whisper.cpp/examples at c8d0f5fe9801862bdd7f63a949937a804d02cfb5 - whisper.cpp - Gitea

ExternalVendorCode/whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-20 23:55:04 +00:00

Files

History

Akash Mahajan c8d0f5fe98 whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058 )

* add HuggingFace mirror to download  ggml model

* support tdrz via simple hack overriding solm tokens

* fix incorrect translate/transcribe token_ids that are not static const

* add apollo 13 sample for tdrz demo

* render [SPEAKER TURN] consistently in all terminal output using vocab.id_to_token

* extend whisper_segment with speaker_turn_next field and save in json output

* fix failing go build

* slipped in some python syntax whoops

* whisper : finalize tinydiarize support (add flag + fixes)

* whisper : tdrz support for word-level timestamps (respect max_len)

* java : try to fix tests after adding tdrz_enable flag

* main : remove TODO leftover

* java : fix params order list after adding "tdrz_enable"

* whisper : fix solm and add nosp token

* main : print tinydiarize help

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-07-04 09:45:00 +03:00

..

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

bench : fix Windows linkage by moving ggml benches in whisper lib ..

2023-01-18 21:19:50 +02:00

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027 )"

2023-07-02 21:53:52 +03:00

examples : fix + refactor Levenshtein distance

2023-04-30 19:12:49 +03:00

whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058 )

2023-07-04 09:45:00 +03:00

ggml : sync latest repo (mostly refactoring changes)

2023-07-02 21:46:09 +03:00

Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027 )"

2023-07-02 21:53:52 +03:00

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027 )"

2023-07-02 21:53:52 +03:00

talk-llama : fix new rope interface

2023-07-03 19:24:01 +03:00

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

whisper.android

whisper.android : support decode wav file has 2 channels (#972 )

2023-05-31 10:13:14 +03:00

models : cd statements are quoted to allow spaces in path (#1041 )

2023-06-25 15:27:28 +03:00

whisper.objc : enable Core ML in example & fix segmentation fault (#910 )

2023-05-14 09:47:02 +03:00

whisper.swiftui

whisper.swiftui : update README.md (#682 )

2023-03-29 23:04:38 +03:00

whisper : add memory sizes for Q8_0 (close #846 )

2023-05-01 10:03:56 +03:00

CMakeLists.txt

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

common-ggml.cpp

ggml : sync latest ggml lib

2023-06-25 14:30:44 +03:00

common-ggml.h

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

common-sdl.cpp

examples : refactor in order to reuse code and reduce duplication (#482 )

2023-02-15 19:28:10 +02:00

common-sdl.h

examples : refactor in order to reuse code and reduce duplication (#482 )

2023-02-15 19:28:10 +02:00

common.cpp

ggml : sync latest repo (mostly refactoring changes)

2023-07-02 21:46:09 +03:00

common.h

ggml : sync latest repo (mostly refactoring changes)

2023-07-02 21:46:09 +03:00

dr_wav.h

refactoring : move main + stream in examples + other stuff

2022-10-25 20:53:48 +03:00

generate-karaoke.sh

minor : add comment for using "generate_karaoke.sh"

2022-11-26 10:22:42 +02:00

helpers.js

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

livestream.sh

livestream.sh : run main with model arg instead of default (#453 )

2023-01-27 01:13:31 +02:00

twitch.sh

twitch.sh : various fixes and polishing

2022-12-08 19:20:04 +02:00

yt-wsp.sh

yt-wsp.sh : print help on empty args

2023-02-18 09:42:31 +02:00