Commit Graph

989 Commits

Author SHA1 Message Date
c30bffc8a5 ref #22 : add "duration" option
Can be used to partially process a recording
2022-11-07 20:14:52 +02:00
8fdfb0ba92 Update README.md 2022-11-06 21:04:21 +02:00
c71363f14c examples : add simple script for generating Karaoke video 2022-11-06 09:22:50 +02:00
a09e9123ca Update README.md 2022-11-05 08:44:41 +02:00
d42cf6d0df Update README.md 2022-11-04 22:26:08 +02:00
ef47d77492 main : fix generated bash script 2022-11-04 18:30:38 +02:00
75171c2b79 ggml : multi-thread the ggml_add operator 2022-11-03 20:53:44 +02:00
a2eeb941f6 cmake : fix passing GGML_PERF compile option 2022-11-03 20:19:06 +02:00
0e689f83d8 Update README.md 2022-11-02 22:03:27 +02:00
d5afebd37c whisper : token-level timestamp refactoring (#49, #120)
This turned out pretty good overall. The algorithm has been moved from
main.cpp to whisper.cpp and can be reused for all subtitles types. This
means that now you can specify the maximum length of the generated
lines. Simply provide the "-ml" argument specifying the max length in
number of characters
2022-11-02 21:45:54 +02:00
4b1c32e8ea Update README.md 2022-11-02 18:33:29 +02:00
b5dde365e9 extra : compute SHA of all models files 2022-11-02 18:31:55 +02:00
02dfd5b8c3 whisper : fix extra memory usage after recent processor changes
Had increased the memory buffer to the size of the model and forgot to
bring it down.
2022-11-02 18:31:18 +02:00
c63ce24834 Allow building with Accelerate for x86_64 Macs (#123)
* Cross compile windows

* set env properly

* rm log

* fix review

* Add back space

* Don't force architecture

* Allow building x86_64 with accelerate
2022-11-02 18:00:19 +02:00
137321915f ggml : fix the check for NEON support (#7)
Was using the wrong preprocessor macro
2022-11-02 17:52:24 +02:00
24cd12f647 Cross compilation (#121)
* Cross compile windows

* set env properly

* rm log

* fix review

* Add back space
2022-11-02 08:46:49 +02:00
e46bc56e71 Update README.md 2022-11-01 22:47:58 +02:00
6fb98370ba main : add some comments for the word-level timestamp algorithm 2022-11-01 22:35:21 +02:00
0729da9a3b main : fix some edge cases for word-level timestamps 2022-11-01 22:09:25 +02:00
5dc74e3aff Update README.md 2022-10-31 22:06:05 +02:00
ac8ef34039 Update README.md 2022-10-31 20:19:41 +02:00
b26345cc7b Added for Windows implemenated script download-ggml-model.cmd 2022-10-31 19:38:20 +02:00
8dac3c6e10 Fixed sched_yield 2022-10-30 21:38:18 +02:00
6417e59aad Implemenated sched_yield function for Windows 2022-10-30 21:38:18 +02:00
dc12994603 Update README.md 2022-10-30 17:11:37 +02:00
b0f2aa0ea6 Update README.md 2022-10-30 17:10:46 +02:00
57fb46f307 main : add option for word-leve timestamps (very experimental) 2022-10-30 17:06:57 +02:00
5a9e4260a6 stream : add "--capture" option to select capture device (ref #10) 2022-10-30 08:27:04 +02:00
eba62e0fa1 close #113 : fix struct whisper_token_data 2022-10-30 08:23:52 +02:00
69bdb6624a minor : update whisper.js 2022-10-29 21:28:21 +03:00
12fb303d9d whisper.wasm : update system info print 2022-10-29 20:32:41 +03:00
234f414652 ref #5 : update CMake for Windows build
- __AVX2__ should already be defined due to /arch:AVX2
- _CRT_SECURE_NO_WARNINGS should be defined both for shared and static lib
2022-10-29 19:41:50 +03:00
014a119052 minor : fix multiple definitions of to_timestamp() 2022-10-29 19:37:19 +03:00
dec40be58f parallel : print time of audio boundaries + fix timings 2022-10-29 19:37:19 +03:00
e5044f87d9 ggml : fix barrier 2022-10-29 19:37:19 +03:00
2827cbbbe8 main : merge parallel example in main 2022-10-29 19:37:19 +03:00
0b2dc3c82c parallel : working 2022-10-29 19:37:19 +03:00
a272f10b2e ggml : fix thread-safety of ggml_init and ggml_free 2022-10-29 19:37:19 +03:00
85d6e1e1e7 main : fix sampling time + add max_context parameter 2022-10-29 19:37:19 +03:00
72e9cdd6bf parallel : adding tool for parallel transformer inference 2022-10-29 19:37:19 +03:00
c565c569e7 Define WHISPER_BUILD so as to export symbols on Windows 2022-10-29 13:23:09 +03:00
2c281d190b Update README.md 2022-10-28 22:09:40 +03:00
b89f8960ca Update README.md 2022-10-28 21:40:52 +03:00
6f82320b05 Create README.md 2022-10-28 20:25:37 +03:00
2298310dd8 whisper.nvim : add helper script for the Neovim integration 2022-10-28 20:25:37 +03:00
8347a7bb6a stream : few updates to make it compatible for Vim usage (#99) 2022-10-27 22:10:50 +03:00
fbd513b813 Add OpenBLAS support
Supported via CMake - just add:

cmake .. -DWHISPER_SUPPORT_OPENBLAS=ON

On Ubuntu, you have to install the library like this:

apt install libopenblas-dev

Unfortunately, I don't observe any benefit compared to the
original AVX2 + FP16 implementation. Maybe I'm missing something
2022-10-27 18:31:49 +03:00
ebb01b9e33 Print system info at start of program 2022-10-27 17:22:19 +03:00
9820234f13 Fixed compile definitions and link libraries for MSVC 2022-10-27 17:20:49 +03:00
a22e5741d8 Add helper script to benchmark all models
Simply run:

$ ./extra/bench-all.sh
2022-10-26 23:25:38 +03:00