f420de1322
make : add "-mcpu=native" when building for aarch64 ( #532 )
2023-02-27 21:04:16 +02:00
d176160f6f
readme : add pybind11 bindings ( #538 )
2023-02-27 21:02:11 +02:00
ca21f7ab16
readme : add cython bindings ( #9 )
2023-02-24 08:46:06 +02:00
373043cabe
whisper : zero-initialize some more context variables
...
Just in case
2023-02-21 19:00:42 +02:00
fb4d0d470f
whisper : fix uninitialized exp_n_audio_ctx
2023-02-21 18:58:08 +02:00
0d229163bb
whisper : add API for applying custom logits filters during decoding
2023-02-19 18:35:01 +02:00
f254e78737
yt-wsp.sh : print help on empty args
2023-02-18 09:42:31 +02:00
a94897bcde
whisper : by default disable non-speech tokens suppression ( #473 )
...
This seems to be causing hallucinations in the end of the audio, e.g.:
"Thank you for listening"
"Amen"
..
2023-02-15 21:48:49 +02:00
2407ae8ef0
readme : add Ruby discussion + update .NET discussion
2023-02-15 19:51:54 +02:00
b623ca43b1
bindings : add Ruby ( #500 )
...
* adding ruby bindings
* avoid adding these they are copied in via extconf.rb
* ignore these files here
* add definitions for boolean params
* initial transcribe for ruby
* use en model and transcribe jfk with assertion
* possibly this works for building ruby binding
* ci : try to add ruby workflow
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
2023-02-15 19:46:55 +02:00
69e6e4644a
main : fix std in input ( #503 )
...
if we don't add this as an explicit check, then we get an "error: unknown argument: -" later on
2023-02-15 19:31:16 +02:00
09d7d2b68e
examples : refactor in order to reuse code and reduce duplication ( #482 )
...
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
2023-02-15 19:28:10 +02:00
0336161b7d
whisper : fix signedness compiler warning ( #506 )
2023-02-15 19:08:25 +02:00
459753342d
yt-wsp.sh : add unique filename generation ( #495 )
...
Co-authored-by: genevera <genevera@noreply.users.github.com >
2023-02-14 20:12:51 +02:00
9764782bd9
readme : add another .NET repo ( #303 )
2023-02-14 20:04:03 +02:00
3b010f9bed
readme : add .NET repo ( #303 )
2023-02-11 17:35:33 +02:00
113fcec513
cmake : install whisper.h header ( #485 )
...
Including the header file in the install bundle helps projects that ship binaries.
2023-02-11 09:13:32 +02:00
cfc06bf8df
whisper : suppress non-speech-related token outputs ( #473 )
...
* add non-speech-token suppression
* add suppress non-speech_tokens param
2023-02-08 09:05:34 +02:00
2bfe0ebc0f
whisper : fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder ( #474 )
...
Co-authored-by: Sandro Hanea <sandrohanea@microsoft.com >
2023-02-08 09:01:47 +02:00
4dd7119deb
whisper : only trim if split_on_word is true ( #476 )
2023-02-08 08:43:23 +02:00
ab1916fc59
ci : add node addon test and optimize compilation configuration ( #468 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
* ci: add addon.node test ci
* addon: remove build WHISPER_BUILD_TESTS
* addon: update build command
* addon: add test
* addon: add test file
* addon: adapt to compile on Windows
* addon: fix typo
* addon: reuse jfk.wav
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
* addon: reuse jfk.wav
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
2023-02-05 15:02:08 +02:00
a1c1583cc7
whisper : add whisper_full_lang_id() for getting the context lang ( #461 )
2023-02-05 14:46:26 +02:00
d012b5c7e4
whisper : add "split_on_word" flag when using using "max_len" option ( #455 )
...
* Update whisper.cpp
* fix: trim function
* feat: added flag to split on word
* fix: arguments for main
2023-02-05 14:44:23 +02:00
b2083c5d02
release : v1.2.0
v1.2.0
2023-02-04 09:49:49 +02:00
f3ee4a9673
whisper : reduce memory usage during inference ( #431 )
...
* ggml : add "scratch" buffer support
* ggml : support for scratch ring-buffer
* ggml : bug fix in ggml_repeat()
* ggml : error on scratch buffer overflow
* whisper : use scratch buffers during inference (base model only)
* whisper : update memory usage for all models
* whisper : fix encoder memory usage
* whisper : use whisper_context functions instead of macros
* whisper : fix FF + remove it from README
* ggml : reuse ggml_new_i32
* ggml : refactor the scratch buffer storage
* whisper : reorder scratch buffers in the decoder
* main : add option to disable temp fallback
* Update README.md
2023-02-04 09:45:52 +02:00
c306a7fd89
addon.node : using whisper as a Node.js addon ( #443 )
...
* addon: implement node addon call whisper through cpp
* addon: modify the license to MIT
* addon: remove iostream
* addon: rename dir
* addon: fix typo
* addon: configure cmake to build when cmake-js is used
2023-02-04 09:10:25 +02:00
b2fc4c7010
go : support "auto" as an option when set language ( #462 )
...
Co-authored-by: Ming <ming@localhost>
2023-02-04 09:09:27 +02:00
291980369c
whisper : suppress task tokens ( #442 )
2023-02-04 09:03:14 +02:00
86ef64a855
wasm : fix typo in helper.js ( #459 )
2023-02-04 08:49:15 +02:00
3b1960520a
main : CSV format export trimmed spaces fix ( #444 )
...
* Update main.cpp
Removed string trimming
* Update main.cpp
* Update main.cpp
* Revert "Update main.cpp"
This reverts commit d8924fdcfe
.
* Revert "Update main.cpp"
This reverts commit 252e508d85
.
2023-02-04 08:48:35 +02:00
2bee2650c6
go : add wrapper for system info ( #456 )
2023-01-28 18:44:56 +02:00
beb9512be3
go : add WhisperLangAutoDetect method to go binding ( #451 )
2023-01-27 01:14:20 +02:00
47737b2e82
livestream.sh : run main with model arg instead of default ( #453 )
...
Actually utilizes the $model var when calling ./main.
2023-01-27 01:13:31 +02:00
b992f3709e
whisper : do not provide past prompt when n_max_text_ctx == 0
2023-01-25 20:01:00 +02:00
60337f5306
wasm : check if navigator.storage.estimate() is available
...
Safari does not support it
2023-01-25 20:00:59 +02:00
02c7516c57
go : added wrappers to reset and print timings ( #436 )
2023-01-25 18:57:30 +02:00
411ea9b833
ci : run workflows on pull requests + bindings depend on .h ( #446 )
2023-01-25 18:50:50 +02:00
11f61cecd6
whisper.wasm : add labels for easier radio selection ( #435 )
2023-01-23 20:49:00 +02:00
b5ddb16ec7
whisper : condition timestamps to be monotonically increasing ( #425 )
2023-01-23 20:48:26 +02:00
ae16c21e9c
whisper : PPC64 big-endian support ( #398 )
...
* ggml : set cache line size to 128 on POWER9
* whisper : add PPC64 big endian support
2023-01-23 20:48:10 +02:00
2c3f50a021
release : v1.1.1
v1.1.1
2023-01-23 20:23:44 +02:00
9a65269a20
.gitignore : add arm_neon.h
2023-01-23 20:19:04 +02:00
78f166174f
whisper : fix condition for providing past prompt (critical)
...
This bug has been present since v1.1.0.
Effectively, the past transcribed text wasn't being used for following
transcriptions, which likely significantly reduces the transcription
quality.
Likely related to #419
2023-01-22 10:47:01 +02:00
21c569ba4a
whisper : extend information in whisper_print_timings()
2023-01-19 18:50:33 +02:00
1a91c19af9
whisper : perform entropy check only when we have at least 32 tokens ( #412 )
2023-01-18 22:52:18 +02:00
f583e2d2f5
main : we had accidentally disabled the temperature fallback .. ( #291 )
2023-01-18 22:51:41 +02:00
206fc93396
whisper.wasm : add small and small.en models
2023-01-18 21:58:55 +02:00
a6cf6f4c4a
bench : minor fixes
2023-01-18 21:40:10 +02:00
472a473fd1
main : add an option to accept optional output filenames ( #424 )
...
* Add an option to accept optional output filenames
* Format the file
Co-authored-by: Chia-Hsiang Cheng <gary.chiahsiang.cheng@gmail.com >
2023-01-18 21:26:31 +02:00
9ba66c2fad
stream : fix handling of --step == --length ( #416 )
2023-01-18 21:22:52 +02:00