Skip to content

using Win SpVoice Interface (SAPI) with python to generate audio files with either text-to-speech or phoneme-to-speech

License

Notifications You must be signed in to change notification settings

jetfontanilla/win-sapi-tts-audio-file-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Windows SAPI Text-to-Speech / Phoneme-to-Speech Audio File Generator

using Win SpVoice Interface (SAPI) with python to generate audio files

Configuration

edit config.ini to change to a different Voice Bank, or change other TTS parameters

Generating Audio

create a CSV file with the unique ID as the first parameter and the transcript to run as the second. it also supports phoneme-based text-to-speech by adding the phonemes in the 3rd column (link for supported phonemes) if phonemes are provided, it will ignore the transcript parameter

to generate the audio files, run this command

$ python generator.py

generated audio files will be in the output folder

About

using Win SpVoice Interface (SAPI) with python to generate audio files with either text-to-speech or phoneme-to-speech

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages