whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-01 06:50:41 +00:00

History

Eve c21fb10b28 vulkan: small mul_mat_vec optimizations (llama/10665)

* double the number of rows per workgroup

* Update ggml-vulkan.cpp

* Vulkan: Add VK_EXT_subgroup_size_control support to ensure full subgroups for coopmats

* only increase the number of rows for amd and subgroup size 64

* fix missing NUM_ROWS for mul_mat_vec_iq4_nl_f16_f32, untested

* use subgroup min and max to check for gcn (requires https://github.com/ggerganov/llama.cpp/pull/10721)

* manual merge ggml-vulkan.cpp

* set min and max subgroup size in any case

* Also double the number of rows for Intel GPUs

2024-12-18 12:52:16 +02:00

include

ggml: load all backends from a user-provided search path (llama/10699)

2024-12-18 12:52:16 +02:00

src

vulkan: small mul_mat_vec optimizations (llama/10665)

2024-12-18 12:52:16 +02:00

.gitignore

whisper : reorganize source code + improve CMake (#2256 )

2024-06-26 19:34:09 +03:00

CMakeLists.txt

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797)

2024-12-18 12:52:16 +02:00