boolemancer
0bfe728b84
Fix the Windows pthread_create shim
...
The current implementation doesn't actually set the out parameter,
and it returns 0 on failure instead of on success.
2022-11-08 15:02:32 +02:00
Georgi Gerganov
4e5674a5d5
sync : submodule whisper.spm
2022-11-07 21:48:13 +02:00
Georgi Gerganov
4c66b6a828
cmake : add submodule whisper.spm
2022-11-07 20:50:24 +02:00
Georgi Gerganov
c30bffc8a5
ref #22 : add "duration" option
...
Can be used to partially process a recording
2022-11-07 20:14:52 +02:00
Georgi Gerganov
8fdfb0ba92
Update README.md
2022-11-06 21:04:21 +02:00
Georgi Gerganov
c71363f14c
examples : add simple script for generating Karaoke video
2022-11-06 09:22:50 +02:00
Georgi Gerganov
a09e9123ca
Update README.md
2022-11-05 08:44:41 +02:00
Georgi Gerganov
d42cf6d0df
Update README.md
2022-11-04 22:26:08 +02:00
Georgi Gerganov
ef47d77492
main : fix generated bash script
2022-11-04 18:30:38 +02:00
Georgi Gerganov
75171c2b79
ggml : multi-thread the ggml_add operator
2022-11-03 20:53:44 +02:00
Georgi Gerganov
a2eeb941f6
cmake : fix passing GGML_PERF compile option
2022-11-03 20:19:06 +02:00
Georgi Gerganov
0e689f83d8
Update README.md
2022-11-02 22:03:27 +02:00
Georgi Gerganov
d5afebd37c
whisper : token-level timestamp refactoring ( #49 , #120 )
...
This turned out pretty good overall. The algorithm has been moved from
main.cpp to whisper.cpp and can be reused for all subtitles types. This
means that now you can specify the maximum length of the generated
lines. Simply provide the "-ml" argument specifying the max length in
number of characters
2022-11-02 21:45:54 +02:00
Georgi Gerganov
4b1c32e8ea
Update README.md
2022-11-02 18:33:29 +02:00
Georgi Gerganov
b5dde365e9
extra : compute SHA of all models files
2022-11-02 18:31:55 +02:00
Georgi Gerganov
02dfd5b8c3
whisper : fix extra memory usage after recent processor changes
...
Had increased the memory buffer to the size of the model and forgot to
bring it down.
2022-11-02 18:31:18 +02:00
Syed Jafri
c63ce24834
Allow building with Accelerate for x86_64 Macs ( #123 )
...
* Cross compile windows
* set env properly
* rm log
* fix review
* Add back space
* Don't force architecture
* Allow building x86_64 with accelerate
2022-11-02 18:00:19 +02:00
Georgi Gerganov
137321915f
ggml : fix the check for NEON support ( #7 )
...
Was using the wrong preprocessor macro
2022-11-02 17:52:24 +02:00
Syed Jafri
24cd12f647
Cross compilation ( #121 )
...
* Cross compile windows
* set env properly
* rm log
* fix review
* Add back space
2022-11-02 08:46:49 +02:00
Georgi Gerganov
e46bc56e71
Update README.md
2022-11-01 22:47:58 +02:00
Georgi Gerganov
6fb98370ba
main : add some comments for the word-level timestamp algorithm
2022-11-01 22:35:21 +02:00
Georgi Gerganov
0729da9a3b
main : fix some edge cases for word-level timestamps
2022-11-01 22:09:25 +02:00
Georgi Gerganov
5dc74e3aff
Update README.md
2022-10-31 22:06:05 +02:00
Georgi Gerganov
ac8ef34039
Update README.md
2022-10-31 20:19:41 +02:00
Mikhail Grigorev
b26345cc7b
Added for Windows implemenated script download-ggml-model.cmd
2022-10-31 19:38:20 +02:00
Mikhail Grigorev
8dac3c6e10
Fixed sched_yield
2022-10-30 21:38:18 +02:00
Mikhail Grigorev
6417e59aad
Implemenated sched_yield function for Windows
2022-10-30 21:38:18 +02:00
Georgi Gerganov
dc12994603
Update README.md
2022-10-30 17:11:37 +02:00
Georgi Gerganov
b0f2aa0ea6
Update README.md
2022-10-30 17:10:46 +02:00
Georgi Gerganov
57fb46f307
main : add option for word-leve timestamps (very experimental)
2022-10-30 17:06:57 +02:00
Georgi Gerganov
5a9e4260a6
stream : add "--capture" option to select capture device (ref #10 )
2022-10-30 08:27:04 +02:00
Georgi Gerganov
eba62e0fa1
close #113 : fix struct whisper_token_data
2022-10-30 08:23:52 +02:00
Georgi Gerganov
69bdb6624a
minor : update whisper.js
2022-10-29 21:28:21 +03:00
Georgi Gerganov
12fb303d9d
whisper.wasm : update system info print
2022-10-29 20:32:41 +03:00
Georgi Gerganov
234f414652
ref #5 : update CMake for Windows build
...
- __AVX2__ should already be defined due to /arch:AVX2
- _CRT_SECURE_NO_WARNINGS should be defined both for shared and static lib
2022-10-29 19:41:50 +03:00
Georgi Gerganov
014a119052
minor : fix multiple definitions of to_timestamp()
2022-10-29 19:37:19 +03:00
Georgi Gerganov
dec40be58f
parallel : print time of audio boundaries + fix timings
2022-10-29 19:37:19 +03:00
Georgi Gerganov
e5044f87d9
ggml : fix barrier
2022-10-29 19:37:19 +03:00
Georgi Gerganov
2827cbbbe8
main : merge parallel example in main
2022-10-29 19:37:19 +03:00
Georgi Gerganov
0b2dc3c82c
parallel : working
2022-10-29 19:37:19 +03:00
Georgi Gerganov
a272f10b2e
ggml : fix thread-safety of ggml_init and ggml_free
2022-10-29 19:37:19 +03:00
Georgi Gerganov
85d6e1e1e7
main : fix sampling time + add max_context parameter
2022-10-29 19:37:19 +03:00
Georgi Gerganov
72e9cdd6bf
parallel : adding tool for parallel transformer inference
2022-10-29 19:37:19 +03:00
Borislav Stanimirov
c565c569e7
Define WHISPER_BUILD so as to export symbols on Windows
2022-10-29 13:23:09 +03:00
Georgi Gerganov
2c281d190b
Update README.md
2022-10-28 22:09:40 +03:00
Georgi Gerganov
b89f8960ca
Update README.md
2022-10-28 21:40:52 +03:00
Georgi Gerganov
6f82320b05
Create README.md
2022-10-28 20:25:37 +03:00
Georgi Gerganov
2298310dd8
whisper.nvim : add helper script for the Neovim integration
2022-10-28 20:25:37 +03:00
Georgi Gerganov
8347a7bb6a
stream : few updates to make it compatible for Vim usage ( #99 )
2022-10-27 22:10:50 +03:00
Georgi Gerganov
fbd513b813
Add OpenBLAS support
...
Supported via CMake - just add:
cmake .. -DWHISPER_SUPPORT_OPENBLAS=ON
On Ubuntu, you have to install the library like this:
apt install libopenblas-dev
Unfortunately, I don't observe any benefit compared to the
original AVX2 + FP16 implementation. Maybe I'm missing something
2022-10-27 18:31:49 +03:00