whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-05-31 06:20:58 +00:00

History

Diego Devesa 2a4b5c9d7e cuda : optimize argmax (llama/10441)

* cuda : optimize argmax

* remove unused parameter

ggml-ci

* fixup : use full warps

ggml-ci

* Apply suggestions from code review

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

* fix ub

* ggml : check ne00 <= INT32_MAX in argmax and argsort

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

2024-12-08 20:14:35 +02:00

include

ggml: new optimization interface (ggml/988)

2024-11-20 21:00:08 +02:00

src

cuda : optimize argmax (llama/10441)

2024-12-08 20:14:35 +02:00

.gitignore

whisper : reorganize source code + improve CMake (#2256 )

2024-06-26 19:34:09 +03:00

CMakeLists.txt

add cmake rvv support (llama/10411)

2024-12-08 20:14:35 +02:00