mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-19 07:18:07 +00:00

Files

Przemysław Pawełczyk 3f7a03ebe3 ggml : do not use _GNU_SOURCE gratuitously (#1027 )

* Do not use _GNU_SOURCE gratuitously.

What is needed to build whisper.cpp and examples is availability of
stuff defined in The Open Group Base Specifications Issue 6
(https://pubs.opengroup.org/onlinepubs/009695399/) known also as
Single Unix Specification v3 (SUSv3) or POSIX.1-2001 + XSI extensions.

There is no need to penalize musl libc which simply follows standards.

Not having feature test macros in source code gives greater flexibility
to those wanting to reuse it in 3rd party app, as they can build it with
minimal FTM (_XOPEN_SOURCE=600) or other FTM depending on their needs.

It builds without issues in Alpine (musl libc), Ubuntu (glibc), MSYS2.

* examples : include SDL headers before other headers

This is an attempt at fixing macOS build error coming from SDL2 relying
on Darwin extension memset_pattern4/8/16 coming from Apple's string.h.

2023-06-25 16:34:30 +03:00

.gitignore

talk, talk-llama : add basic example script for eleven-labs tts (#728 )

2023-04-14 19:53:58 +03:00

CMakeLists.txt

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

eleven-labs.py

examples : update elevenlabs scripts to use official python API (#837 )

2023-05-24 21:11:01 +03:00

gpt-2.cpp

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

gpt-2.h

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

README.md

speak scripts for Windows

2023-06-01 22:45:00 +10:00

speak

speak scripts for Windows

2023-06-01 22:45:00 +10:00

speak.bat

speak scripts for Windows

2023-06-01 22:45:00 +10:00

speak.ps1

speak scripts for Windows

2023-06-01 22:45:00 +10:00

talk.cpp

ggml : do not use _GNU_SOURCE gratuitously (#1027 )

2023-06-25 16:34:30 +03:00

README.md

talk

Talk with an Artificial Intelligence in your terminal

Demo Talk

Web version: examples/talk.wasm

Building

The talk tool depends on SDL2 library to capture audio from the microphone. You can build it like this:

# Install SDL2 on Linux
sudo apt-get install libsdl2-dev

# Install SDL2 on Mac OS
brew install sdl2

# Build the "talk" executable
make talk

# Run it
./talk -p Santa

GPT-2

To run this, you will need a ggml GPT-2 model: instructions

Alternatively, you can simply download the smallest ggml GPT-2 117M model (240 MB) like this:

wget --quiet --show-progress -O models/ggml-gpt-2-117M.bin https://huggingface.co/ggerganov/ggml/raw/main/ggml-model-gpt-2-117M.bin

TTS

For best experience, this example needs a TTS tool to convert the generated text responses to voice. You can use any TTS engine that you would like - simply edit the speak script to your needs. By default, it is configured to use MacOS's say or espeak or Windows SpeechSynthesizer, but you can use whatever you wish.