whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-05-28 21:14:11 +00:00

History

Ivan 2fc1d20f9e cuda: add q8_0->f32 cpy operation (llama/9571)

llama: enable K-shift for quantized KV cache
It will fail on unsupported backends or quant types.

2024-09-24 19:45:08 +03:00

2024-06-26 19:34:09 +03:00

2024-09-24 19:45:08 +03:00

2024-09-24 19:45:08 +03:00

.gitignore

2024-06-26 19:34:09 +03:00

CMakeLists.txt

2024-09-24 19:45:08 +03:00

ggml_vk_generate_shaders.py

2024-06-26 19:34:09 +03:00