whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-05-04 09:42:51 +00:00

History

0cc4m 4a6d52efe6 Vulkan: VK_KHR_cooperative_matrix support to speed up prompt processing (llama/10597)

* Vulkan: Implement VK_KHR_cooperative_matrix support in the matrix matrix multiplication shader

* Improve performance with better q4_k and q5_k dequant and store unrolling

* Add Vulkan MUL_MAT and MUL_MAT_ID accumulator precision selection

* Rework mulmat shader selection and compilation logic, avoid compiling shaders that won't get used by device

* Vulkan: Implement accumulator switch for specific mul mat mat shaders

* Vulkan: Unroll more loops for more mul mat mat performance

* Vulkan: Add VK_AMD_shader_core_properties2 support to read Compute Unit count for split_k logic

* Disable coopmat support on AMD proprietary driver

* Remove redundant checks

* Add environment variable GGML_VK_DISABLE_COOPMAT to disable VK_KHR_cooperative_matrix support

* Fix rebase typo

* Fix coopmat2 MUL_MAT_ID pipeline selection

2024-12-18 12:52:16 +02:00

include

ggml : remove old files (skip) (#0 )

2024-12-08 23:04:26 +02:00

src

Vulkan: VK_KHR_cooperative_matrix support to speed up prompt processing (llama/10597)

2024-12-18 12:52:16 +02:00

.gitignore

whisper : reorganize source code + improve CMake (#2256 )

2024-06-26 19:34:09 +03:00

CMakeLists.txt

ggml : add predefined list of CPU backend variants to build (llama/10626)

2024-12-08 20:14:35 +02:00