umini_speech

The first version of Yodi is built on the umini_speech dataset, the mini speech dataset which was developed and is currently being improved by the Umbaji team. It is a set of more than 800 augmented recordings from various users, in different settings speaking the same words as in the mini speech dataset but in Ewè.

The Yodi model was developed by applying STFT transforms to speech data to get the waveforms for training with more meaningful and embeddable data. For the audio inputs, the Ewè words vector space is:

$\alpha = {'gome', 'djo','dusi', 'mio', 'edji', 'note', 'ao','ee'}$

We consider the word labels (transcripts) vector space for the recognition,

$\Omega = { 'down','go','left','no','right','stop','up','yes'}$

Consult the Yodi package here :https://github.com/Umbaji/Yodi

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__MACOSX		__MACOSX
test_set		test_set
umini_speech_commands		umini_speech_commands
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

__MACOSX

__MACOSX

test_set

test_set

umini_speech_commands

umini_speech_commands

README.md

README.md

Repository files navigation

umini_speech

About

Releases 1

Packages

Umbaji/umini_speech

Folders and files

Latest commit

History

Repository files navigation

umini_speech

About

Topics

Resources

Stars

Watchers

Forks