Georgi Gerganov
|
794b162a46
|
whisper : add integer quantization support (#540)
* whisper : add integer quantization support
* examples : add common-ggml + prepare to add "quantize" tool
* whisper : quantization tool ready
* whisper : fix F32 support
* whisper : try to fix shared lib linkage
* wasm : update quantized models to Q5
* bench.wasm : remove "medium" button
* bench.wasm : fix custom model button
* ggml : add Q5_0 and Q5_1 WASM SIMD
* wasm : add quantized models to all WASM examples
* wasm : bump DB version number to 2
* talk-llama : update example to latest llama.cpp
* node : increase test timeout to 10s
* readme : add information for model quantization
* wasm : add links to other examples
|
2023-04-30 18:51:57 +03:00 |
|
Georgi Gerganov
|
87dd4a3081
|
talk.wasm : bump memory usage + update whisper.js
|
2023-01-06 21:13:44 +02:00 |
|
Andy Maloney
|
84c6b42e65
|
cmake : update to 3.19 (#351)
- update from 3.0 (from 2014) to 3.19 (from 2020)
- move some global setting onto the targets (through a cmake include)
|
2023-01-05 21:22:48 +02:00 |
|
Georgi Gerganov
|
e4805d9601
|
wasm : refactor wasm example + reuse fetch mechanism
|
2022-11-24 23:13:26 +02:00 |
|
Georgi Gerganov
|
37422ed733
|
talk.wasm : add audio pre-processing + bump memory
|
2022-11-24 00:34:00 +02:00 |
|
Georgi Gerganov
|
be3b720f96
|
talk.wasm : refactoring + update README.md
|
2022-11-24 00:08:57 +02:00 |
|
Georgi Gerganov
|
a4dfbeecf9
|
talk.wasm : GPT-2 meets Whisper in WebAssembly (#155)
* talk : initial real-time transcription in the browser
* talk : polishing the UI
* talk : ready for beta testing
* talk.wasm : rename example
|
2022-11-21 22:20:42 +02:00 |
|