Commit Graph

252 Commits

Author SHA1 Message Date
Georgi Gerganov
a272f10b2e ggml : fix thread-safety of ggml_init and ggml_free 2022-10-29 19:37:19 +03:00
Georgi Gerganov
85d6e1e1e7 main : fix sampling time + add max_context parameter 2022-10-29 19:37:19 +03:00
Georgi Gerganov
72e9cdd6bf parallel : adding tool for parallel transformer inference 2022-10-29 19:37:19 +03:00
Borislav Stanimirov
c565c569e7 Define WHISPER_BUILD so as to export symbols on Windows 2022-10-29 13:23:09 +03:00
Georgi Gerganov
2c281d190b
Update README.md 2022-10-28 22:09:40 +03:00
Georgi Gerganov
b89f8960ca
Update README.md 2022-10-28 21:40:52 +03:00
Georgi Gerganov
6f82320b05 Create README.md 2022-10-28 20:25:37 +03:00
Georgi Gerganov
2298310dd8 whisper.nvim : add helper script for the Neovim integration 2022-10-28 20:25:37 +03:00
Georgi Gerganov
8347a7bb6a
stream : few updates to make it compatible for Vim usage (#99) 2022-10-27 22:10:50 +03:00
Georgi Gerganov
fbd513b813 Add OpenBLAS support
Supported via CMake - just add:

cmake .. -DWHISPER_SUPPORT_OPENBLAS=ON

On Ubuntu, you have to install the library like this:

apt install libopenblas-dev

Unfortunately, I don't observe any benefit compared to the
original AVX2 + FP16 implementation. Maybe I'm missing something
2022-10-27 18:31:49 +03:00
Georgi Gerganov
ebb01b9e33
Print system info at start of program 2022-10-27 17:22:19 +03:00
Mikhail Grigorev
9820234f13 Fixed compile definitions and link libraries for MSVC 2022-10-27 17:20:49 +03:00
Georgi Gerganov
a22e5741d8
Add helper script to benchmark all models
Simply run:

$ ./extra/bench-all.sh
2022-10-26 23:25:38 +03:00
Georgi Gerganov
2400660f3f Print system info in main 2022-10-26 22:54:09 +03:00
Georgi Gerganov
058a27b2e5
Create README.md 2022-10-26 18:14:10 +03:00
andypayne
a09ce6e889 Changes to work by default on macOS - use curl when wget is not available, and use an alternative method to get the script path when realpath is not available. 2022-10-26 12:18:18 +03:00
Georgi Gerganov
a6c786d5dc Update README.md 2022-10-25 20:53:48 +03:00
Georgi Gerganov
9ccafa8792 Update README.md 2022-10-25 20:53:48 +03:00
Georgi Gerganov
89d8ee3ee5 Update README.md 2022-10-25 20:53:48 +03:00
Georgi Gerganov
91dcf5f35b Update README.md 2022-10-25 20:53:48 +03:00
Georgi Gerganov
113a4f06d8 Update README.md 2022-10-25 20:53:48 +03:00
Georgi Gerganov
47e78b7288 Update README.md 2022-10-25 20:53:48 +03:00
Georgi Gerganov
34bb3ab0cf ggml : add system info functions 2022-10-25 20:53:48 +03:00
Georgi Gerganov
c6710efde2 refactoring : move main + stream in examples + other stuff 2022-10-25 20:53:48 +03:00
Georgi Gerganov
4c68f4cac0
main : fix SRT timestamp to use comma "," instead of dot "." 2022-10-24 18:28:23 +03:00
Georgi Gerganov
728676927f
Update README.md 2022-10-24 18:26:21 +03:00
Georgi Gerganov
d4f94ce427 Update README.md 2022-10-24 18:23:07 +03:00
Georgi Gerganov
a52ee08c1e objc : polishing the sample application 2022-10-24 18:23:07 +03:00
Georgi Gerganov
b41f4a90eb Create README.md 2022-10-24 18:23:07 +03:00
Georgi Gerganov
bb1ee266d2 ios : whisper.objc example 2022-10-24 18:23:07 +03:00
Georgi Gerganov
5f7e9fa2dc
ref #68, #79 : fix segment time output 2022-10-23 13:30:30 +03:00
Georgi Gerganov
181b762de8
Update README.md 2022-10-23 12:47:51 +03:00
Georgi Gerganov
3d37ad5133
Merge pull request #78 from jokkebk/Specify-utf8-for-vocab.json
Add enconding parameter to vocab.json opening to fix errors
2022-10-23 12:23:04 +03:00
Joonas Pihlajamaa
4e887dc350 Add enconding parameter to vocab.json opening to fix errors 2022-10-23 11:55:01 +03:00
Georgi Gerganov
4196856c7b
Update README.md 2022-10-23 10:24:36 +03:00
Georgi Gerganov
705198f063
Update README.md 2022-10-23 10:12:10 +03:00
Georgi Gerganov
3e69a6071d
Update README.md 2022-10-23 08:04:33 +03:00
Georgi Gerganov
f3dae90c31 Update README.md 2022-10-22 21:17:21 +03:00
Georgi Gerganov
6a81ed3e78 main : print colors + no timestamps 2022-10-22 21:17:21 +03:00
Georgi Gerganov
7affd309d3 whisper : add new-segment callback
Can be used to process new segments as they are being generated.
Sample usage in main, for printing the resulting segments during the
inference.
2022-10-22 21:17:21 +03:00
Georgi Gerganov
8f95c25aed main : refactor subtitle output 2022-10-22 21:17:21 +03:00
Georgi Gerganov
31ff0c6a1f wip : experimental color coding of tokens based on probabilities 2022-10-22 21:17:21 +03:00
Georgi Gerganov
f4aa01c2f8
Update README.md 2022-10-22 19:30:35 +03:00
Georgi Gerganov
8c1d970088
Update README.md 2022-10-22 19:00:25 +03:00
Georgi Gerganov
6b45e37b2b Update README.md and finalize the whisper.wasm example 2022-10-22 18:54:01 +03:00
Georgi Gerganov
491ecd7056 wip : polishing WASM example 2022-10-22 18:54:01 +03:00
Georgi Gerganov
db460b78ff wip : WASM 128-bit SIMD support 2022-10-22 18:54:01 +03:00
Georgi Gerganov
e905c6f827 wip : initial WASM port
Works but it is very slow because no SIMD is used.
For example, jfk.wav is processed in ~23 seconds using "tiny.en" model
2022-10-22 18:54:01 +03:00
Georgi Gerganov
7d0dee7a8a
ref #68 : add option "-on" to specify segment index offset for SRT
Also, change option "-o" to "-ot"
2022-10-21 18:14:53 +03:00
Georgi Gerganov
8d15a1c635
ci : fix and re-enable tests (2nd try) 2022-10-21 15:57:20 +03:00