whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-05-31 06:20:58 +00:00

History

mahorozte 4af9626702 CUDA: remove unnecessary warp reduce in FA (ggml/1032)

* kqmax_new_j in every thread within warp is same after operate at line 199,this reduce can be omit

* same problem in vec32

---------

Co-authored-by: ZhaoXiaoYu <zhao.xiaoyu@zte.com.cn>

2024-12-08 20:14:35 +02:00

include

ggml: new optimization interface (ggml/988)

2024-11-20 21:00:08 +02:00

src

CUDA: remove unnecessary warp reduce in FA (ggml/1032)

2024-12-08 20:14:35 +02:00

.gitignore

whisper : reorganize source code + improve CMake (#2256 )

2024-06-26 19:34:09 +03:00

CMakeLists.txt

ggml : sync resolve (skip) (#0 )

2024-11-20 21:00:08 +02:00