whisper.cpp/ggml
Eve 30e35d7271 vulkan: Dynamic subgroup size support for Q6_K mat_vec (llama/10536)
* subgroup 64 version with subgroup add. 15% faster

scalable version

tested for subgroup sizes 16-128

* check for subgroup multiple of 16 and greater than 16

* subgroup sizes are always a power of 2 (https://github.com/KhronosGroup/GLSL/issues/45)

* force 16 sequential threads per block

* make 16 subgroup size a constant
2024-12-08 20:14:35 +02:00
..
include ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541) 2024-12-08 20:14:35 +02:00
src vulkan: Dynamic subgroup size support for Q6_K mat_vec (llama/10536) 2024-12-08 20:14:35 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml : add support for dynamic loading of backends (llama/10469) 2024-12-08 20:14:35 +02:00