Leo Moll
8fcd1a3b32
main : provide option for creating JSON output ( #615 )
...
* examples : provide option for exporting also as JSON file (ggerganov/whisper.cpp#614 )
* main : remove leftovers
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-22 21:37:36 +02:00
Kamilake
992aa2cd1b
models : change default encoding to utf8 ( #605 )
2023-03-22 21:17:24 +02:00
Georgi Gerganov
4aa3bcf8a4
make : fix MUSL Linux build ( #576 )
2023-03-22 20:51:42 +02:00
Georgi Gerganov
1beff6f66d
models : change HF hosting from dataset to model
2023-03-22 20:44:56 +02:00
Takeshi Inoue
09e9068007
whisper.android : support benchmark for Android example. ( #542 )
...
* whisper.android: Support benchmark for Android example.
* whisper.android: update screenshot in README.
* update: Make text selectable for copy & paste.
* Update whisper.h to restore API name
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* whisper.android: Restore original API names.
---------
Co-authored-by: tinoue <tinoue@xevo.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-07 21:36:30 +02:00
Georgi Gerganov
fa9d43181f
readme : add bench-wts.sh demo
2023-03-06 21:06:27 +02:00
Georgi Gerganov
bb6b54a03d
bench-wts.sh : rename script + add execute permission
2023-03-06 21:02:24 +02:00
venkr
b597c5a779
qual-bench.sh : add quality comparison tool, and update main.cpp to allow using a font file ( #569 )
2023-03-06 19:18:11 +02:00
Takeshi Inoue
a3fb6c507f
whisper.android : enable fp16 instrinsics (FP16_VA) which is supported by ARMv8.2 or later. ( #572 )
2023-03-06 19:15:57 +02:00
sandrohanea
59fdcd19c8
whisper : add whisper_state + default state on the whisper_context ( #523 )
...
* Added whisper state + default state on the whisper_context
* Fixed some examples and bindings
* Fixed whisper_n_len (which was used in some binding) and added whisper_n_len_from_state
* Fixed comments
* whisper : reuse kv_cache_free() and fix compiler warnings
* whisper : clean-up the API comments
---------
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-05 21:42:19 +02:00
Georgi Gerganov
478289a4b3
whisper : set no_context == true by default ( #537 )
2023-03-05 20:53:43 +02:00
polarmoon
5e94129cb2
go : NewContext now returns a clean context ( #537 )
...
Co-authored-by: Ming <ming@localhost>
2023-03-05 20:50:25 +02:00
HY. Kelvin Lee
72af0f5697
main : add csv header ( #552 )
2023-03-02 18:32:16 +02:00
Georgi Gerganov
af005d573f
make : add -DNDEBUG compile flag
2023-02-28 23:27:54 +02:00
Georgi Gerganov
ad1389003d
release : v1.2.1
2023-02-28 22:29:12 +02:00
FlippFuzz
f420de1322
make : add "-mcpu=native" when building for aarch64 ( #532 )
2023-02-27 21:04:16 +02:00
Aaron Pham
d176160f6f
readme : add pybind11 bindings ( #538 )
2023-02-27 21:02:11 +02:00
Georgi Gerganov
ca21f7ab16
readme : add cython bindings ( #9 )
2023-02-24 08:46:06 +02:00
Georgi Gerganov
373043cabe
whisper : zero-initialize some more context variables
...
Just in case
2023-02-21 19:00:42 +02:00
Finn Voorhees
fb4d0d470f
whisper : fix uninitialized exp_n_audio_ctx
2023-02-21 18:58:08 +02:00
Georgi Gerganov
0d229163bb
whisper : add API for applying custom logits filters during decoding
2023-02-19 18:35:01 +02:00
Georgi Gerganov
f254e78737
yt-wsp.sh : print help on empty args
2023-02-18 09:42:31 +02:00
Georgi Gerganov
a94897bcde
whisper : by default disable non-speech tokens suppression ( #473 )
...
This seems to be causing hallucinations in the end of the audio, e.g.:
"Thank you for listening"
"Amen"
..
2023-02-15 21:48:49 +02:00
Georgi Gerganov
2407ae8ef0
readme : add Ruby discussion + update .NET discussion
2023-02-15 19:51:54 +02:00
Todd
b623ca43b1
bindings : add Ruby ( #500 )
...
* adding ruby bindings
* avoid adding these they are copied in via extconf.rb
* ignore these files here
* add definitions for boolean params
* initial transcribe for ruby
* use en model and transcribe jfk with assertion
* possibly this works for building ruby binding
* ci : try to add ruby workflow
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-02-15 19:46:55 +02:00
conradg
69e6e4644a
main : fix std in input ( #503 )
...
if we don't add this as an explicit check, then we get an "error: unknown argument: -" later on
2023-02-15 19:31:16 +02:00
Georgi Gerganov
09d7d2b68e
examples : refactor in order to reuse code and reduce duplication ( #482 )
...
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
2023-02-15 19:28:10 +02:00
shikokuchuo
0336161b7d
whisper : fix signedness compiler warning ( #506 )
2023-02-15 19:08:25 +02:00
genevera (she/her)
459753342d
yt-wsp.sh : add unique filename generation ( #495 )
...
Co-authored-by: genevera <genevera@noreply.users.github.com>
2023-02-14 20:12:51 +02:00
Georgi Gerganov
9764782bd9
readme : add another .NET repo ( #303 )
2023-02-14 20:04:03 +02:00
Georgi Gerganov
3b010f9bed
readme : add .NET repo ( #303 )
2023-02-11 17:35:33 +02:00
Avik Sengupta
113fcec513
cmake : install whisper.h header ( #485 )
...
Including the header file in the install bundle helps projects that ship binaries.
2023-02-11 09:13:32 +02:00
shibukazu
cfc06bf8df
whisper : suppress non-speech-related token outputs ( #473 )
...
* add non-speech-token suppression
* add suppress non-speech_tokens param
2023-02-08 09:05:34 +02:00
sandrohanea
2bfe0ebc0f
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder ( #474 )
...
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com>
2023-02-08 09:01:47 +02:00
boolemancer
4dd7119deb
whisper : only trim if split_on_word is true ( #476 )
2023-02-08 08:43:23 +02:00
Qianhe Chen
ab1916fc59
ci : add node addon test and optimize compilation configuration ( #468 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-02-05 15:02:08 +02:00
kamranjon
a1c1583cc7
whisper : add whisper_full_lang_id() for getting the context lang ( #461 )
2023-02-05 14:46:26 +02:00
Matija Pevec
d012b5c7e4
whisper : add "split_on_word" flag when using using "max_len" option ( #455 )
...
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
2023-02-05 14:44:23 +02:00
Georgi Gerganov
b2083c5d02
release : v1.2.0
2023-02-04 09:49:49 +02:00
Georgi Gerganov
f3ee4a9673
whisper : reduce memory usage during inference ( #431 )
...
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
2023-02-04 09:45:52 +02:00
Qianhe Chen
c306a7fd89
addon.node : using whisper as a Node.js addon ( #443 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
2023-02-04 09:10:25 +02:00
polarmoon
b2fc4c7010
go : support "auto" as an option when set language ( #462 )
...
Co-authored-by: Ming <ming@localhost>
2023-02-04 09:09:27 +02:00
Georgi Gerganov
291980369c
whisper : suppress task tokens ( #442 )
2023-02-04 09:03:14 +02:00
Taisei Mima
86ef64a855
wasm : fix typo in helper.js ( #459 )
2023-02-04 08:49:15 +02:00
Alex Bacart
3b1960520a
main : CSV format export trimmed spaces fix ( #444 )
...
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit d8924fdcfe
.
* Revert "Update main.cpp"
This reverts commit 252e508d85
.
2023-02-04 08:48:35 +02:00
Lukas Rist
2bee2650c6
go : add wrapper for system info ( #456 )
2023-01-28 18:44:56 +02:00
Robin
beb9512be3
go : add WhisperLangAutoDetect method to go binding ( #451 )
2023-01-27 01:14:20 +02:00
Eric Tendian
47737b2e82
livestream.sh : run main with model arg instead of default ( #453 )
...
Actually utilizes the $model var when calling ./main.
2023-01-27 01:13:31 +02:00
Georgi Gerganov
b992f3709e
whisper : do not provide past prompt when n_max_text_ctx == 0
2023-01-25 20:01:00 +02:00
Georgi Gerganov
60337f5306
wasm : check if navigator.storage.estimate() is available
...
Safari does not support it
2023-01-25 20:00:59 +02:00