Commit Graph

775 Commits

Author SHA1 Message Date
f007e186fe Fix: main get language from cli args 2022-10-05 09:24:53 +07:00
e7a15876f8 Update README.md 2022-10-04 23:27:25 +03:00
6814cc9b02 Improve result printing 2022-10-04 23:18:15 +03:00
eba33adadd Extend C-style API with full inference methods 2022-10-04 23:18:15 +03:00
6b77124e01 Initial C-style interface for whisper.cpp 2022-10-04 23:18:15 +03:00
be8ba034f6 ref #10 : handle Ctrl+C in "stream" app 2022-10-02 20:11:17 +03:00
d71e567656 Update README.md 2022-10-02 18:19:22 +03:00
b6bf906730 ref #10 : quick-and-dirty attempt for real-time audio transciption
- Processes input in chunks of 3 seconds.
- Padding audio with silence
- Uses 1 second audio from previous pass
- No text context
2022-10-02 17:55:45 +03:00
77d929f603 Fix bug in FFT
The FFT routine does not work for odd N
Solution is to add DFT and use it when N is odd
2022-10-02 17:46:21 +03:00
6d654d192a Fix reading of stereo WAV files 2022-10-01 08:41:57 +03:00
62897e8ae6 Update README.md 2022-10-01 00:01:04 +03:00
15b49e8baf Bug fix
Longer prompts could cause out-of-bounds access
2022-09-30 20:37:29 +03:00
3bcdbdfc32 Reduce memory usage even more + better sampling
- The encode/decode memory buffers are now reused
- If the 30-sec segment goes for too long without a timestamp token, we
  force one. Improves transcription for large model
- Stereo support
- Add "micro-machines.wav" sample
2022-09-30 19:35:27 +03:00
310f4883d1 Update README.md 2022-09-29 23:48:01 +03:00
fd3f3d748f Update README.md 2022-09-29 23:37:59 +03:00
5877c3578e ref #4 : added transcription timestamps
Can be turned off with "-nt" argument.
Performance has also improved.
2022-09-29 23:09:39 +03:00
8d4041c31f Merge pull request #3 from cdosoftei/master
Pass -pthread to linker
2022-09-28 22:06:09 +03:00
d4fcfa47b0 Pass -pthread to linker 2022-09-28 15:01:54 -04:00
4352a6018b Update README.md 2022-09-28 21:13:32 +03:00
f888c2373d Flash + language support (ref #2)
- Achieved big performance improvement + memory usage reduction
- Can now translate / transcribe different languages
2022-09-28 21:07:32 +03:00
154fa796dd ref #1 : add -pthread to compilation flags 2022-09-26 11:58:44 +03:00
476182e439 Update README.md and simplify usage 2022-09-26 09:36:51 +03:00
f2456f8d93 Create README.md 2022-09-25 22:59:04 +03:00
28802c4dae Create LICENSE 2022-09-25 22:15:44 +03:00
b0a11594ae Initial release 2022-09-25 22:13:49 +03:00