whisper.cpp/examples at 741abb162ce8207eca2562e59c03efefb21c0122 - whisper.cpp - Gitea

ExternalVendorCode/whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-17 22:38:07 +00:00

Files

History

denersc 741abb162c whisper : token-level timestamps with DTW (#1485 )

* whisper.cpp: impl dtw algo

* WIP: producing and placing DTW timestamps on tokens

* Fix compile and assertion errors. Attempt to DTW timestamp with single_segment=false.

* Fix mistake causing incorrect alignment of dtw timestamps

* implement N_TOP_MOST and CUSTOM alignment heads setting

* whisper: fix typo on alignment heads enum

* Fix issues related to changes in whisper.cpp

* Fixed excessive memory use when using DTW timestamps. Other minor fixes to DTW timestamping function

* decoder: save cross QKs only if requested

* Calling median filter with ggml_map_custom1

* Reimpl aheads n_top_most and custom. Sanity checks on chosen aheads

* Copying cross QKs from decoder backend correctly

* dtw: cleanup

* Fix incorrect n_frames passed to dtw when near end of audio

* Fix aheads_masks_init for backend != CPU

* whisper : minor style

* main : add dtw (wip)

* whisper: fix invalid memory access in aheads_masks_init

* main : add dtw (cont)

* whisper : minor

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2024-03-20 18:25:26 +02:00

..

examples : clean up common code (#1871 )

2024-02-19 10:50:15 +02:00

examples : fix typo in bench.cpp (#1933 )

2024-03-06 22:21:44 +00:00

whisper : add support for large v3 (#1444 )

2023-11-07 15:30:18 +02:00

examples : clean up common code (#1871 )

2024-02-19 10:50:15 +02:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

examples : clean up common code (#1871 )

2024-02-19 10:50:15 +02:00

whisper : token-level timestamps with DTW (#1485 )

2024-03-20 18:25:26 +02:00

examples : add python example for transcription (#1744 )

2024-01-13 19:37:18 +02:00

quantize : fix load vocab crash when len is 128 (#1160 )

2023-08-06 11:04:42 +03:00

examples : rename --audio-context to --audio-ctx per help text (#1953 )

2024-03-18 17:53:33 +02:00

examples : clean up common code (#1871 )

2024-02-19 10:50:15 +02:00

stream.wasm : fix invalid memory access when no segments (#1902 )

2024-02-26 10:12:35 +02:00

whisper : add SYCL support (#1863 )

2024-02-23 09:22:24 +02:00

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

talk-llama : sync llama.cpp

2024-03-15 14:21:59 +02:00

examples : clean up common code (#1871 )

2024-02-19 10:50:15 +02:00

examples : initialize context params properly (#1852 )

2024-02-11 16:39:12 +02:00

whisper.android

ggml : 32-bit arm compat (#1891 )

2024-02-22 18:31:40 +02:00

whisper.android.java

whisper.android.java : fix returns in JNI (#1929 )

2024-03-05 15:59:26 +02:00

examples : vim plugin and LSP server (#1144 )

2023-08-27 21:35:06 +03:00

docs : make model options / model install methods clearer (#1806 )

2024-01-26 17:39:54 +02:00

whisper.swiftui

whisper.swiftui : add .gitignore

2024-01-04 15:00:27 +02:00

whisper : add context param to disable gpu (#1293 )

2023-11-06 11:04:24 +02:00

CMakeLists.txt

whisper : add SYCL support (#1863 )

2024-02-23 09:22:24 +02:00

common-ggml.cpp

update examples and tests

2024-03-15 14:01:14 +02:00

common-ggml.h

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

common-sdl.cpp

sdl : fix audio callback (#1523 )

2023-11-20 13:16:38 +02:00

common-sdl.h

sdl : fix audio callback (#1523 )

2023-11-20 13:16:38 +02:00

common.cpp

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

common.h

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

dr_wav.h

refactoring : move main + stream in examples + other stuff

2022-10-25 20:53:48 +03:00

generate-karaoke.sh

minor : add comment for using "generate_karaoke.sh"

2022-11-26 10:22:42 +02:00

grammar-parser.cpp

whisper : add grammar-based sampling (#1229 )

2023-11-13 10:51:34 +02:00

grammar-parser.h

whisper : add grammar-based sampling (#1229 )

2023-11-13 10:51:34 +02:00

helpers.js

wchess : whisper assisted chess (#1595 )

2023-12-14 15:58:26 +02:00

json.hpp

examples : clean up common code (#1871 )

2024-02-19 10:50:15 +02:00

livestream.sh

whisper : make large version explicit + fix data size units (#1493 )

2023-11-15 19:42:25 +02:00

twitch.sh

whisper : make large version explicit + fix data size units (#1493 )

2023-11-15 19:42:25 +02:00

yt-wsp.sh

yt-wsp.sh : print help on empty args

2023-02-18 09:42:31 +02:00