whisper.cpp/examples/talk-llama/speak

#!/bin/bash

# Usage:
#  speak <voice_id> <textfile>

function installed() { command -v $1 >/dev/null 2>&1; }

if installed espeak; then
  espeak -v en-us+m$1 -s 225 -p 50 -a 200 -g 5 -k 5 -f $2

elif installed piper && installed aplay; then
  cat $2 | piper --model ~/en_US-lessac-medium.onnx --output-raw | aplay -q -r 22050 -f S16_LE -t raw -

# for Mac
elif installed say; then
  say -f $2

# Eleven Labs
elif installed python3 && \
  python3 -c 'import importlib.util; exit(not importlib.util.find_spec("elevenlabs"))' && \
  installed ffplay; then
    # It's possible to use the API for free with limited number of characters.
    # To increase this limit register to https://beta.elevenlabs.io to get an api key
    # and paste it after 'ELEVEN_API_KEY='
    # Keep the line commented to use the free version without api key
    #export ELEVEN_API_KEY=your_api_key
    wd=$(dirname $0)
    script=$wd/eleven-labs.py
    python3 $script -q -p -v $1 $2 >/dev/null 2>&1

    # Uncomment to keep the audio file
    #python3 $script -q -s ./audio.mp3 -v $1 $2 >/dev/null 2>&1
    #ffplay -autoexit -nodisp -loglevel quiet -hide_banner -i ./audio.mp3 >/dev/null 2>&1

else
  echo 'Install espeak ("brew install espeak" or "apt-get install espeak"),'
  echo 'piper ("pip install piper-tts" or https://github.com/rhasspy/piper) with aplay,'
  echo 'or elevenlabs ("pip install elevenlabs") with ffplay.'
  echo '(export ELEVEN_API_KEY if you have an api key from https://beta.elevenlabs.io)'
fi
talk-llama : add new example + sync ggml from llama.cpp (#664) * talk-llama : talk with LLaMA AI * talk.llama : disable EOS token * talk-llama : add README instructions * ggml : fix build in debug 2023-03-27 21:00:32 +03:00			`#!/bin/bash`

			`# Usage:`
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 16:24:47 +09:00			`# speak <voice_id> <textfile>`

			`function installed() { command -v $1 >/dev/null 2>&1; }`

			`if installed espeak; then`
			`espeak -v en-us+m$1 -s 225 -p 50 -a 200 -g 5 -k 5 -f $2`

			`elif installed piper && installed aplay; then`
			`cat $2 \| piper --model ~/en_US-lessac-medium.onnx --output-raw \| aplay -q -r 22050 -f S16_LE -t raw -`
talk-llama : add optional Piper TTS support (#1749) Add commented-out command to optionally use Piper (https://github.com/rhasspy/piper) as text-to-speech solution for the talk-llama example. Piper voices sound almost like real people which is a big improvement (e.g.) from something like espeak. 2024-01-10 15:15:28 +01:00
talk-llama : add new example + sync ggml from llama.cpp (#664) * talk-llama : talk with LLaMA AI * talk.llama : disable EOS token * talk-llama : add README instructions * ggml : fix build in debug 2023-03-27 21:00:32 +03:00			`# for Mac`
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 16:24:47 +09:00			`elif installed say; then`
			`say -f $2`
talk-llama : add new example + sync ggml from llama.cpp (#664) * talk-llama : talk with LLaMA AI * talk.llama : disable EOS token * talk-llama : add README instructions * ggml : fix build in debug 2023-03-27 21:00:32 +03:00
			`# Eleven Labs`
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 16:24:47 +09:00			`elif installed python3 && \`
			`python3 -c 'import importlib.util; exit(not importlib.util.find_spec("elevenlabs"))' && \`
			`installed ffplay; then`
			`# It's possible to use the API for free with limited number of characters.`
			`# To increase this limit register to https://beta.elevenlabs.io to get an api key`
			`# and paste it after 'ELEVEN_API_KEY='`
			`# Keep the line commented to use the free version without api key`
			`#export ELEVEN_API_KEY=your_api_key`
			`wd=$(dirname $0)`
			`script=$wd/eleven-labs.py`
			`python3 $script -q -p -v $1 $2 >/dev/null 2>&1`

			`# Uncomment to keep the audio file`
			`#python3 $script -q -s ./audio.mp3 -v $1 $2 >/dev/null 2>&1`
			`#ffplay -autoexit -nodisp -loglevel quiet -hide_banner -i ./audio.mp3 >/dev/null 2>&1`

			`else`
			`echo 'Install espeak ("brew install espeak" or "apt-get install espeak"),'`
			`echo 'piper ("pip install piper-tts" or https://github.com/rhasspy/piper) with aplay,'`
			`echo 'or elevenlabs ("pip install elevenlabs") with ffplay.'`
			`echo '(export ELEVEN_API_KEY if you have an api key from https://beta.elevenlabs.io)'`
			`fi`