whisper.cpp/ggml
Eve d8bf63a41b vulkan: dynamic subgroup size for the remaining k quants (llama/10745)
* q5_k

q4_k

q3_k

q2_k

q6_k multi row example

* revert as multi row isnt faster for k quants
2024-12-18 12:52:16 +02:00
..
include ggml : refactor online repacking (llama/10446) 2024-12-18 12:52:16 +02:00
src vulkan: dynamic subgroup size for the remaining k quants (llama/10745) 2024-12-18 12:52:16 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml : add predefined list of CPU backend variants to build (llama/10626) 2024-12-08 20:14:35 +02:00