Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
tts
speech-synthesis
transformer
voice-recognition
speech-recognition
whisper
asr
vocoder
conformer
sound-classification
kws
self-supervised-learning
code-switch
voice-cloning
speech-translation
punctuation-restoration
wav2vec2
streaming-asr
speech-alignment
streaming-tts
-
Updated
Apr 16, 2024 - Python