2023-04-14 18:53:58 +02:00
|
|
|
import sys
|
talk, talk-llama : pass text_to_speak as a file (#1865)
* talk-llama: pass file instead of arg
it is too hard to quote text in a portable way
* talk-llama: pass heard_ok as a file
* talk-llama: let eleven-labs.py accept options
Options: -v voice, -s savefile, -p (--play)
* talk-llama: check installed commands in "speak"
Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed
* talk-llama: pass voice_id again
in order to sync talk with talk-llama
* talk: sync with talk-llama
Passing text_to_speak as a file is safer and more portable
cf. https://stackoverflow.com/a/59036879/45375
* talk and talk-llama: get all installed voices in speak.ps1
* talk and talk-llama: get voices from api
* talk and talk-llama: add more options to eleven-labs.py
and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/)
```
usage: eleven-labs.py [-q] [-l] [-h] [-n NAME | -v NUMBER] [-f KEY=VAL] [-s FILE | -p] [TEXTFILE]
options:
-q, --quick skip checking the required library
action:
TEXTFILE read the text file (default: stdin)
-l, --list show the list of voices and exit
-h, --help show this help and exit
voice selection:
-n NAME, --name NAME get a voice object by name (default: Arnold)
-v NUMBER, --voice NUMBER
get a voice object by number (see --list)
-f KEY=VAL, --filter KEY=VAL
filter voices by labels (default: "use case=narration")
this option can be used multiple times
filtering will be disabled if the first -f has no "=" (e.g. -f "any")
output:
-s FILE, --save FILE save the TTS to a file (default: audio.mp3)
-p, --play play the TTS with ffplay
```
* examples: add speak_with_file()
as suggested in the review
* talk and talk-llama: ignore to_speak.txt
2024-02-24 16:24:47 +09:00
|
|
|
import argparse
|
|
|
|
import textwrap
|
2023-04-14 18:53:58 +02:00
|
|
|
|
talk, talk-llama : pass text_to_speak as a file (#1865)
* talk-llama: pass file instead of arg
it is too hard to quote text in a portable way
* talk-llama: pass heard_ok as a file
* talk-llama: let eleven-labs.py accept options
Options: -v voice, -s savefile, -p (--play)
* talk-llama: check installed commands in "speak"
Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed
* talk-llama: pass voice_id again
in order to sync talk with talk-llama
* talk: sync with talk-llama
Passing text_to_speak as a file is safer and more portable
cf. https://stackoverflow.com/a/59036879/45375
* talk and talk-llama: get all installed voices in speak.ps1
* talk and talk-llama: get voices from api
* talk and talk-llama: add more options to eleven-labs.py
and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/)
```
usage: eleven-labs.py [-q] [-l] [-h] [-n NAME | -v NUMBER] [-f KEY=VAL] [-s FILE | -p] [TEXTFILE]
options:
-q, --quick skip checking the required library
action:
TEXTFILE read the text file (default: stdin)
-l, --list show the list of voices and exit
-h, --help show this help and exit
voice selection:
-n NAME, --name NAME get a voice object by name (default: Arnold)
-v NUMBER, --voice NUMBER
get a voice object by number (see --list)
-f KEY=VAL, --filter KEY=VAL
filter voices by labels (default: "use case=narration")
this option can be used multiple times
filtering will be disabled if the first -f has no "=" (e.g. -f "any")
output:
-s FILE, --save FILE save the TTS to a file (default: audio.mp3)
-p, --play play the TTS with ffplay
```
* examples: add speak_with_file()
as suggested in the review
* talk and talk-llama: ignore to_speak.txt
2024-02-24 16:24:47 +09:00
|
|
|
parser = argparse.ArgumentParser(add_help=False,
|
|
|
|
formatter_class=argparse.RawTextHelpFormatter)
|
|
|
|
parser.add_argument("-q", "--quick", action="store_true",
|
|
|
|
help="skip checking the required library")
|
|
|
|
|
|
|
|
modes = parser.add_argument_group("action")
|
|
|
|
modes.add_argument("inputfile", metavar="TEXTFILE",
|
|
|
|
nargs='?', type=argparse.FileType(), default=sys.stdin,
|
|
|
|
help="read the text file (default: stdin)")
|
|
|
|
modes.add_argument("-l", "--list", action="store_true",
|
|
|
|
help="show the list of voices and exit")
|
|
|
|
modes.add_argument("-h", "--help", action="help",
|
|
|
|
help="show this help and exit")
|
|
|
|
|
|
|
|
selopts = parser.add_argument_group("voice selection")
|
|
|
|
selmodes = selopts.add_mutually_exclusive_group()
|
|
|
|
selmodes.add_argument("-n", "--name",
|
|
|
|
default="Arnold",
|
|
|
|
help="get a voice object by name (default: Arnold)")
|
|
|
|
selmodes.add_argument("-v", "--voice", type=int, metavar="NUMBER",
|
|
|
|
help="get a voice object by number (see --list)")
|
|
|
|
selopts.add_argument("-f", "--filter", action="append", metavar="KEY=VAL",
|
|
|
|
default=["use case=narration"],
|
|
|
|
help=textwrap.dedent('''\
|
|
|
|
filter voices by labels (default: "use case=narration")
|
|
|
|
this option can be used multiple times
|
|
|
|
filtering will be disabled if the first -f has no "=" (e.g. -f "any")
|
|
|
|
'''))
|
|
|
|
|
|
|
|
outmodes = parser.add_argument_group("output")
|
|
|
|
outgroup = outmodes.add_mutually_exclusive_group()
|
|
|
|
outgroup.add_argument("-s", "--save", metavar="FILE",
|
|
|
|
default="audio.mp3",
|
|
|
|
help="save the TTS to a file (default: audio.mp3)")
|
|
|
|
outgroup.add_argument("-p", "--play", action="store_true",
|
|
|
|
help="play the TTS with ffplay")
|
|
|
|
|
|
|
|
args = parser.parse_args()
|
2023-04-14 18:53:58 +02:00
|
|
|
|
talk, talk-llama : pass text_to_speak as a file (#1865)
* talk-llama: pass file instead of arg
it is too hard to quote text in a portable way
* talk-llama: pass heard_ok as a file
* talk-llama: let eleven-labs.py accept options
Options: -v voice, -s savefile, -p (--play)
* talk-llama: check installed commands in "speak"
Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed
* talk-llama: pass voice_id again
in order to sync talk with talk-llama
* talk: sync with talk-llama
Passing text_to_speak as a file is safer and more portable
cf. https://stackoverflow.com/a/59036879/45375
* talk and talk-llama: get all installed voices in speak.ps1
* talk and talk-llama: get voices from api
* talk and talk-llama: add more options to eleven-labs.py
and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/)
```
usage: eleven-labs.py [-q] [-l] [-h] [-n NAME | -v NUMBER] [-f KEY=VAL] [-s FILE | -p] [TEXTFILE]
options:
-q, --quick skip checking the required library
action:
TEXTFILE read the text file (default: stdin)
-l, --list show the list of voices and exit
-h, --help show this help and exit
voice selection:
-n NAME, --name NAME get a voice object by name (default: Arnold)
-v NUMBER, --voice NUMBER
get a voice object by number (see --list)
-f KEY=VAL, --filter KEY=VAL
filter voices by labels (default: "use case=narration")
this option can be used multiple times
filtering will be disabled if the first -f has no "=" (e.g. -f "any")
output:
-s FILE, --save FILE save the TTS to a file (default: audio.mp3)
-p, --play play the TTS with ffplay
```
* examples: add speak_with_file()
as suggested in the review
* talk and talk-llama: ignore to_speak.txt
2024-02-24 16:24:47 +09:00
|
|
|
if not args.quick:
|
|
|
|
import importlib.util
|
|
|
|
if importlib.util.find_spec("elevenlabs") is None:
|
|
|
|
print("elevenlabs library is not installed, you can install it to your enviroment using 'pip install elevenlabs'")
|
|
|
|
sys.exit()
|
2023-04-14 18:53:58 +02:00
|
|
|
|
talk, talk-llama : pass text_to_speak as a file (#1865)
* talk-llama: pass file instead of arg
it is too hard to quote text in a portable way
* talk-llama: pass heard_ok as a file
* talk-llama: let eleven-labs.py accept options
Options: -v voice, -s savefile, -p (--play)
* talk-llama: check installed commands in "speak"
Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed
* talk-llama: pass voice_id again
in order to sync talk with talk-llama
* talk: sync with talk-llama
Passing text_to_speak as a file is safer and more portable
cf. https://stackoverflow.com/a/59036879/45375
* talk and talk-llama: get all installed voices in speak.ps1
* talk and talk-llama: get voices from api
* talk and talk-llama: add more options to eleven-labs.py
and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/)
```
usage: eleven-labs.py [-q] [-l] [-h] [-n NAME | -v NUMBER] [-f KEY=VAL] [-s FILE | -p] [TEXTFILE]
options:
-q, --quick skip checking the required library
action:
TEXTFILE read the text file (default: stdin)
-l, --list show the list of voices and exit
-h, --help show this help and exit
voice selection:
-n NAME, --name NAME get a voice object by name (default: Arnold)
-v NUMBER, --voice NUMBER
get a voice object by number (see --list)
-f KEY=VAL, --filter KEY=VAL
filter voices by labels (default: "use case=narration")
this option can be used multiple times
filtering will be disabled if the first -f has no "=" (e.g. -f "any")
output:
-s FILE, --save FILE save the TTS to a file (default: audio.mp3)
-p, --play play the TTS with ffplay
```
* examples: add speak_with_file()
as suggested in the review
* talk and talk-llama: ignore to_speak.txt
2024-02-24 16:24:47 +09:00
|
|
|
from elevenlabs import voices, generate, play, save
|
|
|
|
|
|
|
|
if args.filter and "=" in args.filter[0]:
|
|
|
|
voicelist = voices()
|
|
|
|
for f in args.filter:
|
|
|
|
label, value = f.split("=")
|
|
|
|
voicelist = filter(lambda x: x.labels.get(label) == value, voicelist)
|
|
|
|
voicelist = list(voicelist)
|
|
|
|
else:
|
|
|
|
voicelist = list(voices())
|
|
|
|
|
|
|
|
if args.list:
|
|
|
|
for i, v in enumerate(voicelist):
|
|
|
|
print(str(i) + ": " + v.name + " " + str(v.labels))
|
|
|
|
sys.exit()
|
|
|
|
|
|
|
|
if args.voice:
|
|
|
|
voice = voicelist[args.voice % len(voicelist)]
|
|
|
|
else:
|
|
|
|
voice = args.name
|
|
|
|
# if -n should consult -f, use the following
|
|
|
|
#voice = next(x for x in voicelist if x.name == args.name)
|
2023-04-14 18:53:58 +02:00
|
|
|
|
2023-05-24 20:11:01 +02:00
|
|
|
audio = generate(
|
talk, talk-llama : pass text_to_speak as a file (#1865)
* talk-llama: pass file instead of arg
it is too hard to quote text in a portable way
* talk-llama: pass heard_ok as a file
* talk-llama: let eleven-labs.py accept options
Options: -v voice, -s savefile, -p (--play)
* talk-llama: check installed commands in "speak"
Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed
* talk-llama: pass voice_id again
in order to sync talk with talk-llama
* talk: sync with talk-llama
Passing text_to_speak as a file is safer and more portable
cf. https://stackoverflow.com/a/59036879/45375
* talk and talk-llama: get all installed voices in speak.ps1
* talk and talk-llama: get voices from api
* talk and talk-llama: add more options to eleven-labs.py
and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/)
```
usage: eleven-labs.py [-q] [-l] [-h] [-n NAME | -v NUMBER] [-f KEY=VAL] [-s FILE | -p] [TEXTFILE]
options:
-q, --quick skip checking the required library
action:
TEXTFILE read the text file (default: stdin)
-l, --list show the list of voices and exit
-h, --help show this help and exit
voice selection:
-n NAME, --name NAME get a voice object by name (default: Arnold)
-v NUMBER, --voice NUMBER
get a voice object by number (see --list)
-f KEY=VAL, --filter KEY=VAL
filter voices by labels (default: "use case=narration")
this option can be used multiple times
filtering will be disabled if the first -f has no "=" (e.g. -f "any")
output:
-s FILE, --save FILE save the TTS to a file (default: audio.mp3)
-p, --play play the TTS with ffplay
```
* examples: add speak_with_file()
as suggested in the review
* talk and talk-llama: ignore to_speak.txt
2024-02-24 16:24:47 +09:00
|
|
|
text=str(args.inputfile.read()),
|
|
|
|
voice=voice
|
2023-05-24 20:11:01 +02:00
|
|
|
)
|
talk, talk-llama : pass text_to_speak as a file (#1865)
* talk-llama: pass file instead of arg
it is too hard to quote text in a portable way
* talk-llama: pass heard_ok as a file
* talk-llama: let eleven-labs.py accept options
Options: -v voice, -s savefile, -p (--play)
* talk-llama: check installed commands in "speak"
Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed
* talk-llama: pass voice_id again
in order to sync talk with talk-llama
* talk: sync with talk-llama
Passing text_to_speak as a file is safer and more portable
cf. https://stackoverflow.com/a/59036879/45375
* talk and talk-llama: get all installed voices in speak.ps1
* talk and talk-llama: get voices from api
* talk and talk-llama: add more options to eleven-labs.py
and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/)
```
usage: eleven-labs.py [-q] [-l] [-h] [-n NAME | -v NUMBER] [-f KEY=VAL] [-s FILE | -p] [TEXTFILE]
options:
-q, --quick skip checking the required library
action:
TEXTFILE read the text file (default: stdin)
-l, --list show the list of voices and exit
-h, --help show this help and exit
voice selection:
-n NAME, --name NAME get a voice object by name (default: Arnold)
-v NUMBER, --voice NUMBER
get a voice object by number (see --list)
-f KEY=VAL, --filter KEY=VAL
filter voices by labels (default: "use case=narration")
this option can be used multiple times
filtering will be disabled if the first -f has no "=" (e.g. -f "any")
output:
-s FILE, --save FILE save the TTS to a file (default: audio.mp3)
-p, --play play the TTS with ffplay
```
* examples: add speak_with_file()
as suggested in the review
* talk and talk-llama: ignore to_speak.txt
2024-02-24 16:24:47 +09:00
|
|
|
if args.play:
|
|
|
|
play(audio)
|
|
|
|
else:
|
|
|
|
save(audio, args.save)
|