whisper.cpp/examples
Georgi Gerganov 93935980f8
whisper : Metal and ggml-alloc support (#1270)
* metal : init

* whisper : factor out graph builds

* whisper : allocate encoder and decoder using ggml-alloc

* whisper : ggml-alloc is now supported

* whisper : CoreML support ggml-alloc

* build : fix ggml-alloc

* ios : update submodule

* extra : update sync-ggml.sh script to also sync ggml-alloc

* ci : see if this is causing the crash

* whisper : refactor ggml-alloc init

* whisper.android : try to fix build

* whisper : initial Metal version

* ci : try to debug vmem issue

* metal : decoder works on GPU!

* metal : add multi-decoder support

* ggml : fix ggml_nbytes (probably temp solution)

* metal : run "cross" step on the GPU

* whisper : remove ggml_repeat in the encoder

* whisper : offload the Encoder to Metal

* ggml : use simpler ggml_bytes() implementation

* ggml-alloc : try to make CI happy by reducing vram to 128GB

* whisper : add whisper_allocr to wrap ggml_allocr

* whisper : factor out alloc init in a function

* cmake : update to support Metal build

* whisper : add <functional> header

* objc : fix build (no Metal yet)

* ios : add Metal support

* swiftui : fix build

* metal : speed-up KQ multiplication

* metal : sync latest llama.cpp kernels

* readme : add Metal info

* ios : update submodule

* coreml : add code to toggle Core ML config (CPU, ANE, GPU)

* bench : fix timings by running a pre-heat

* bench : start benching the decoder

* whisper : add ggml_mul_mat_pad

* bench : fix uninitialized vars

* whisper : add comment for disabling mul-mat padding

* whisper : add description of ggml_mul_mat_pad

* whisper : clean-up ggml_mul_mat_pad

* metal : remove the "concurrent" flag

* bench : variable n_past

* ios : update SPM package
2023-09-15 12:18:18 +03:00
..
addon.node whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
bench whisper : Metal and ggml-alloc support (#1270) 2023-09-15 12:18:18 +03:00
bench.wasm whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
command build : do not use _GNU_SOURCE gratuitously (#1129) 2023-09-07 12:36:14 +03:00
command.wasm examples : fix + refactor Levenshtein distance 2023-04-30 19:12:49 +03:00
lsp examples : fix build + compile warnings (close #1256) 2023-09-07 12:33:12 +03:00
main examples : fix build + compile warnings (close #1256) 2023-09-07 12:33:12 +03:00
quantize quantize : fix load vocab crash when len is 128 (#1160) 2023-08-06 11:04:42 +03:00
stream build : do not use _GNU_SOURCE gratuitously (#1129) 2023-09-07 12:36:14 +03:00
stream.wasm whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
talk build : do not use _GNU_SOURCE gratuitously (#1129) 2023-09-07 12:36:14 +03:00
talk-llama whisper : Metal and ggml-alloc support (#1270) 2023-09-15 12:18:18 +03:00
talk.wasm ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220) 2023-09-05 13:54:40 +03:00
whisper.android whisper : Metal and ggml-alloc support (#1270) 2023-09-15 12:18:18 +03:00
whisper.nvim examples : vim plugin and LSP server (#1144) 2023-08-27 21:35:06 +03:00
whisper.objc whisper : Metal and ggml-alloc support (#1270) 2023-09-15 12:18:18 +03:00
whisper.swiftui whisper : Metal and ggml-alloc support (#1270) 2023-09-15 12:18:18 +03:00
whisper.wasm whisper : add memory sizes for Q8_0 (close #846) 2023-05-01 10:03:56 +03:00
CMakeLists.txt examples : vim plugin and LSP server (#1144) 2023-08-27 21:35:06 +03:00
common-ggml.cpp ggml : sync latest ggml lib 2023-06-25 14:30:44 +03:00
common-ggml.h whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
common-sdl.cpp examples : refactor in order to reuse code and reduce duplication (#482) 2023-02-15 19:28:10 +02:00
common-sdl.h examples : refactor in order to reuse code and reduce duplication (#482) 2023-02-15 19:28:10 +02:00
common.cpp examples : fix build + compile warnings (close #1256) 2023-09-07 12:33:12 +03:00
common.h ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220) 2023-09-05 13:54:40 +03:00
dr_wav.h refactoring : move main + stream in examples + other stuff 2022-10-25 20:53:48 +03:00
generate-karaoke.sh minor : add comment for using "generate_karaoke.sh" 2022-11-26 10:22:42 +02:00
helpers.js whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
livestream.sh livestream.sh : run main with model arg instead of default (#453) 2023-01-27 01:13:31 +02:00
twitch.sh twitch.sh : various fixes and polishing 2022-12-08 19:20:04 +02:00
yt-wsp.sh yt-wsp.sh : print help on empty args 2023-02-18 09:42:31 +02:00