joint training for ASR rnn_attention based model and conformer framework

The repository implements an ASR framework. We proposed a joint training network that contains an enhancement model a Fbank feature extraction model an End-to-End ASR model and a discriminant model. For comparson purpose we implement two End-to-End ASR models, RNN-Attention based model and conformer model.

Requirements

python 3,5 Pytorch 0.4.0

Data

For evaluation the performance of the model, we use AISHELL-1 to train and test the model, which is a Mandarin corpus. You can download the AISHELL-1 online.
For training the enhancement network, we use NOISE-92 as the background noise.
For language model training, we use a pretrained open source chinese embeding vectors as our initial embeding vectors in the language model. You can also use your own dataset. But the dataset must be splited into three parts train dev and test.

Framework

The input data of the enhancement model is 257 dimension STFT feature. Afterwards, the enhancement network will estimate a mask, which has the same size as the input data. The estimated clean signal is computed by element multipling the input data and the mask.
The End-to-End network estimate the posteriori probabilites for ouptut sequence.
In the joint training, an additional discriminant network is connected with the enhancement network, and can guide the enhancement network training towards true clean signal.

run the framework

you can run run_att_model.sh or run_conformer_model to train and test the model. The difference is different End-to-End network, RNN-Attention network or conformer model.
The configrations for training and test stages are saved .yml files in config folder.
You must change some path directions in run_att_model.sh and run_conformer_model.

Results

clean speech test set for RNN-attention network:

|------------------------------------------------------------------------------|
| SPKR    |  # Snt   # Wrd  |  Corr     Sub      Del     Ins      Err   S.Err  |
|---------+-----------------+--------------------------------------------------|
| Sum/Avg |  7176   104765  |  86.6    13.0      0.5     0.5     14.0    67.5  |
|==============================================================================|

noise speech test set for RNN-attention network:

|------------------------------------------------------------------------------|
| SPKR    |  # Snt   # Wrd  |  Corr     Sub      Del     Ins      Err   S.Err  |
|---------+-----------------+--------------------------------------------------|
| Sum/Avg |  7176  104765  |  76.9      22.1     0.9      0.7    23.7    77.8  |
|==============================================================================|

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
__pycache__		__pycache__
config		config
conformer		conformer
conformer_options		conformer_options
data		data
images		images
model		model
options		options
transformer		transformer
utils		utils
venv		venv
README.md		README.md
asr_recog.py		asr_recog.py
asr_recog_conf.py		asr_recog_conf.py
asr_train.py		asr_train.py
asr_train_conf.py		asr_train_conf.py
cmd.sh		cmd.sh
draw_loss.ipynb		draw_loss.ipynb
e2e_asr_conformer.py		e2e_asr_conformer.py
enhance_base_train.py		enhance_base_train.py
enhance_fbank_train.py		enhance_fbank_train.py
enhance_gan_train.py		enhance_gan_train.py
enhance_out.py		enhance_out.py
fake_opt.py		fake_opt.py
fix_data.sh		fix_data.sh
joint_recog.py		joint_recog.py
joint_recog_conformer.py		joint_recog_conformer.py
joint_train.py		joint_train.py
joint_train_conformer.py		joint_train_conformer.py
lm_train.py		lm_train.py
path.sh		path.sh
recog_att.sh		recog_att.sh
recog_conformer.sh		recog_conformer.sh
rewav.py		rewav.py
run_att_model.sh		run_att_model.sh
run_conformer_model.sh		run_conformer_model.sh
trans_train.py		trans_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

joint training for ASR rnn_attention based model and conformer framework

Requirements

Data

Framework

run the framework

Results

About

Releases

Packages

Languages

AlbertSebastain/e2e_conformer

Folders and files

Latest commit

History

Repository files navigation

joint training for ASR rnn_attention based model and conformer framework

Requirements

Data

Framework

run the framework

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages