GitHub - jetfontanilla/win-sapi-tts-audio-file-generator: using Win SpVoice Interface (SAPI) with python to generate audio files with either text-to-speech or phoneme-to-speech

Windows SAPI Text-to-Speech / Phoneme-to-Speech Audio File Generator

using Win SpVoice Interface (SAPI) with python to generate audio files

Configuration

edit config.ini to change to a different Voice Bank, or change other TTS parameters

Generating Audio

create a CSV file with the unique ID as the first parameter and the transcript to run as the second. it also supports phoneme-based text-to-speech by adding the phonemes in the 3rd column (link for supported phonemes) if phonemes are provided, it will ignore the transcript parameter

to generate the audio files, run this command

$ python generator.py

generated audio files will be in the output folder

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
output		output
tts		tts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
generate.py		generate.py
transcripts.csv		transcripts.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

output

output

tts

tts

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

config.ini

config.ini

generate.py

generate.py

transcripts.csv

transcripts.csv

Repository files navigation

Windows SAPI Text-to-Speech / Phoneme-to-Speech Audio File Generator

Configuration

Generating Audio

About

Releases

Packages

Languages

License

jetfontanilla/win-sapi-tts-audio-file-generator

Folders and files

Latest commit

History

Repository files navigation

Windows SAPI Text-to-Speech / Phoneme-to-Speech Audio File Generator

Configuration

Generating Audio

About

Topics

Resources

License

Stars

Watchers

Forks

Languages