whisper.cpp/examples at 1d79e78402abaa9d92e1f55bb15316443b29ea9f - whisper.cpp - Gitea

ExternalVendorCode/whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-11 03:31:36 +00:00

History

Georgi Gerganov b6c5f49b78

whisper : add batched decoding (#1486 )

* whisper : add whisper_batch

* whisper : move kv_self to whisper_state

* whisper : full batched decoding support

* whisper : fix memory leak in whisper_batch

* whisper : fix mem leak again + remove oboslete function

* whisper : clear kv cache when using whisper_decode API

* whisper : speed-up sampling

* whisper : fix decoders initializer

* bench : add batch size 5 bench

* whisper : add comment about the KV cache size

* whisper : add check for max number of decoders

* whisper : avoid starting sampling threads with bs=1

* whisper : enable beam-search by default

* cuda : sync llama.cpp fixes

2023-11-15 16:12:52 +02:00

..

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

whisper : add batched decoding (#1486 )

2023-11-15 16:12:52 +02:00

whisper : add support for large v3 (#1444 )

2023-11-07 15:30:18 +02:00

whisper : add grammar-based sampling (#1229 )

2023-11-13 10:51:34 +02:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

whisper : add batched decoding (#1486 )

2023-11-15 16:12:52 +02:00

quantize : fix load vocab crash when len is 128 (#1160 )

2023-08-06 11:04:42 +03:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

whisper : add full CUDA and Metal offloading (#1472 )

2023-11-12 15:31:08 +02:00

talk-llama : add n_gpu_layers parameter (#1475 )

2023-11-13 10:04:16 +02:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

whisper.android

examples : add whisper.android.java for compatibility with older Android versions using Java (#1382 )

2023-11-12 18:31:58 +02:00

whisper.android.java

examples : add whisper.android.java for compatibility with older Android versions using Java (#1382 )

2023-11-12 18:31:58 +02:00

examples : vim plugin and LSP server (#1144 )

2023-08-27 21:35:06 +03:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

whisper.swiftui

ios : add support for Swift Package Manager (#1370 )

2023-11-07 23:53:31 +02:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

CMakeLists.txt

whisper : add grammar-based sampling (#1229 )

2023-11-13 10:51:34 +02:00

common-ggml.cpp

ggml : sync latest ggml lib

2023-06-25 14:30:44 +03:00

common-ggml.h

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

common-sdl.cpp

examples : refactor in order to reuse code and reduce duplication (#482 )

2023-02-15 19:28:10 +02:00

common-sdl.h

examples : refactor in order to reuse code and reduce duplication (#482 )

2023-02-15 19:28:10 +02:00

common.cpp

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422 )

2023-11-03 21:35:05 +02:00

common.h

whisper : add full CUDA and Metal offloading (#1472 )

2023-11-12 15:31:08 +02:00

dr_wav.h

refactoring : move main + stream in examples + other stuff

2022-10-25 20:53:48 +03:00

generate-karaoke.sh

minor : add comment for using "generate_karaoke.sh"

2022-11-26 10:22:42 +02:00

grammar-parser.cpp

whisper : add grammar-based sampling (#1229 )

2023-11-13 10:51:34 +02:00

grammar-parser.h

whisper : add grammar-based sampling (#1229 )

2023-11-13 10:51:34 +02:00

helpers.js

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

livestream.sh

whisper : add support for large v3 (#1444 )

2023-11-07 15:30:18 +02:00

twitch.sh

whisper : add support for large v3 (#1444 )

2023-11-07 15:30:18 +02:00

yt-wsp.sh

yt-wsp.sh : print help on empty args

2023-02-18 09:42:31 +02:00