mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-22 00:13:35 +00:00

Files

Georgi Gerganov 794b162a46 whisper : add integer quantization support (#540 )

* whisper : add integer quantization support

* examples : add common-ggml + prepare to add "quantize" tool

* whisper : quantization tool ready

* whisper : fix F32 support

* whisper : try to fix shared lib linkage

* wasm : update quantized models to Q5

* bench.wasm : remove "medium" button

* bench.wasm : fix custom model button

* ggml : add Q5_0 and Q5_1 WASM SIMD

* wasm : add quantized models to all WASM examples

* wasm : bump DB version number to 2

* talk-llama : update example to latest llama.cpp

* node : increase test timeout to 10s

* readme : add information for model quantization

* wasm : add links to other examples

2023-04-30 18:51:57 +03:00

.gitignore

talk, talk-llama : add basic example script for eleven-labs tts (#728 )

2023-04-14 19:53:58 +03:00

CMakeLists.txt

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

eleven-labs.py

talk, talk-llama : add basic example script for eleven-labs tts (#728 )

2023-04-14 19:53:58 +03:00

gpt-2.cpp

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

gpt-2.h

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

README.md

models : change HF hosting from dataset to model

2023-03-22 20:44:56 +02:00

speak.sh

talk, talk-llama : add basic example script for eleven-labs tts (#728 )

2023-04-14 19:53:58 +03:00

talk.cpp

examples : refactor in order to reuse code and reduce duplication (#482 )

2023-02-15 19:28:10 +02:00

README.md

talk

Talk with an Artificial Intelligence in your terminal

Demo Talk

Web version: examples/talk.wasm

Building

The talk tool depends on SDL2 library to capture audio from the microphone. You can build it like this:

# Install SDL2 on Linux
sudo apt-get install libsdl2-dev

# Install SDL2 on Mac OS
brew install sdl2

# Build the "talk" executable
make talk

# Run it
./talk -p Santa

GPT-2

To run this, you will need a ggml GPT-2 model: instructions

Alternatively, you can simply download the smallest ggml GPT-2 117M model (240 MB) like this:

wget --quiet --show-progress -O models/ggml-gpt-2-117M.bin https://huggingface.co/ggerganov/ggml/raw/main/ggml-model-gpt-2-117M.bin

TTS

For best experience, this example needs a TTS tool to convert the generated text responses to voice. You can use any TTS engine that you would like - simply edit the speak.sh script to your needs. By default, it is configured to use espeak, but you can use whatever you wish.