Skip to content

ziadasem/generate_audio_using_vae

Repository files navigation

Generating Sound

in this repository, FSDD (free spoken digits dataset) Audio Files are preprocessed using a preprocessing pipeline (see Audio Signal Processing for ML) to train a Varitoanl Auto Encoder Model to generate new audio that outputs the generated audio in /Audio directory.

Some Notes:

  • this repo is for demo only, so the quality of the output audio isn't the best
  • this repo initially was written without the intent of being published, so the code may be unorganized at some points, but it will be restructured later

References:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published