Skip to content

ANYMS-A/Speech-Enhancement-Models

Repository files navigation

Speech enhancement models using spectrograms as features

Speech-Enhancement-Models

Speech enhancement models:MLP, Auto-encoder, GAN

Dataset

The dataset is the speech enhancment dataset built by the University of Edinburgh. DataShare.

Requirements

  • PyTorch
conda install pytorch torchvision -c pytorch
  • librosa
pip install librosa

Notes:

The audios should be sliced into pieces with equal time length. Then do Short Time Fourier Transform on them, turn them into a 2D matirx. Then we use CNN to extract features from them.

About

Speech enhancement models:MLP, Auto-encoder, GAN

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages