Commit Graph

147 Commits

Author SHA1 Message Date
a09e9123ca Update README.md 2022-11-05 08:44:41 +02:00
0e689f83d8 Update README.md 2022-11-02 22:03:27 +02:00
d5afebd37c whisper : token-level timestamp refactoring (#49, #120)
This turned out pretty good overall. The algorithm has been moved from
main.cpp to whisper.cpp and can be reused for all subtitles types. This
means that now you can specify the maximum length of the generated
lines. Simply provide the "-ml" argument specifying the max length in
number of characters
2022-11-02 21:45:54 +02:00
4b1c32e8ea Update README.md 2022-11-02 18:33:29 +02:00
b5dde365e9 extra : compute SHA of all models files 2022-11-02 18:31:55 +02:00
e46bc56e71 Update README.md 2022-11-01 22:47:58 +02:00
b0f2aa0ea6 Update README.md 2022-10-30 17:10:46 +02:00
2c281d190b Update README.md 2022-10-28 22:09:40 +03:00
9ccafa8792 Update README.md 2022-10-25 20:53:48 +03:00
89d8ee3ee5 Update README.md 2022-10-25 20:53:48 +03:00
c6710efde2 refactoring : move main + stream in examples + other stuff 2022-10-25 20:53:48 +03:00
728676927f Update README.md 2022-10-24 18:26:21 +03:00
181b762de8 Update README.md 2022-10-23 12:47:51 +03:00
4196856c7b Update README.md 2022-10-23 10:24:36 +03:00
705198f063 Update README.md 2022-10-23 10:12:10 +03:00
3e69a6071d Update README.md 2022-10-23 08:04:33 +03:00
f3dae90c31 Update README.md 2022-10-22 21:17:21 +03:00
8c1d970088 Update README.md 2022-10-22 19:00:25 +03:00
6b45e37b2b Update README.md and finalize the whisper.wasm example 2022-10-22 18:54:01 +03:00
5698b51718 Update README.md 2022-10-20 17:52:59 +03:00
3fe3898ebb Update README.md 2022-10-20 17:43:56 +03:00
81c185576c Update README.md 2022-10-20 17:39:31 +03:00
1969ee4bc7 Update README.md 2022-10-18 22:20:35 +03:00
72d967bce4 Use Accelerate framework on Apple silicon
Huge performance improvement in the Encode (almost x2 on MacBook M1 Pro)

Also various extra optimizations:

- Multi-threaded NORM operator
- Faster GELU via F16 cast
2022-10-18 00:12:51 +03:00
36945162fa Update README.md (ref #50) 2022-10-15 09:40:08 +03:00
b2f1600aa3 Update README.md 2022-10-12 21:25:42 +03:00
1348796a93 Update README.md (#43)
* Update README.md

Updated README.md to list new features, such as subtitle file support (VTT and SRT)

* Update README.md

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2022-10-12 07:32:14 +03:00
8d94358251 Update README.md 2022-10-11 00:36:32 +03:00
ad6693fb64 Update README.md 2022-10-10 22:16:25 +03:00
63b6786767 Minor 2022-10-10 22:06:27 +03:00
f7ab81fe51 Update README.md 2022-10-10 22:05:37 +03:00
4c4ab71d4d Update README.md 2022-10-08 11:46:34 +03:00
2d47693435 Update README.md 2022-10-08 11:43:42 +03:00
700898e6ed ref #22 : add option to provide multiple input .wav files 2022-10-05 23:44:10 +03:00
6b1c3cc198 Update README.md 2022-10-05 23:13:15 +03:00
b8f713482e Minor updates 2022-10-05 23:11:02 +03:00
e7a15876f8 Update README.md 2022-10-04 23:27:25 +03:00
d71e567656 Update README.md 2022-10-02 18:19:22 +03:00
62897e8ae6 Update README.md 2022-10-01 00:01:04 +03:00
3bcdbdfc32 Reduce memory usage even more + better sampling
- The encode/decode memory buffers are now reused
- If the 30-sec segment goes for too long without a timestamp token, we
  force one. Improves transcription for large model
- Stereo support
- Add "micro-machines.wav" sample
2022-09-30 19:35:27 +03:00
310f4883d1 Update README.md 2022-09-29 23:48:01 +03:00
fd3f3d748f Update README.md 2022-09-29 23:37:59 +03:00
5877c3578e ref #4 : added transcription timestamps
Can be turned off with "-nt" argument.
Performance has also improved.
2022-09-29 23:09:39 +03:00
4352a6018b Update README.md 2022-09-28 21:13:32 +03:00
f888c2373d Flash + language support (ref #2)
- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages
2022-09-28 21:07:32 +03:00
476182e439 Update README.md and simplify usage 2022-09-26 09:36:51 +03:00
f2456f8d93 Create README.md 2022-09-25 22:59:04 +03:00