whisper.cpp/ggml
Shanshan Shen 9a5ef7b169 CANN: Improve the Inferencing Performance for Ascend NPU Device (llama/10454)
* improve inferencing performance for ascend npu.

Co-authored-by: Frank Mai <thxCode@thxcode0824@gmail.com>

* some modification after review

* some modifications after review

* restore some modifications

* restore some modifications

---------

Co-authored-by: shanshan shen <shanshanshen333@gmail.com>
Co-authored-by: Frank Mai <thxCode@thxcode0824@gmail.com>
2024-12-08 20:14:35 +02:00
..
include ggml : add support for dynamic loading of backends (llama/10469) 2024-12-08 20:14:35 +02:00
src CANN: Improve the Inferencing Performance for Ascend NPU Device (llama/10454) 2024-12-08 20:14:35 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml : add support for dynamic loading of backends (llama/10469) 2024-12-08 20:14:35 +02:00