The ultimate VITS2

The idea for this repo is to implement the most comprehensive VITS2 out here.

Changelist

# Cython-version Monotonoic Alignment Search
cd monotonic_align
mkdir monotonic_align
python setup.py build_ext --inplace

Model	How to set up json file in configs	Sample of json file configuration
iSTFT-VITS2	`"istft_vits": true,` `"upsample_rates": [8,8],`	istft_vits2_base.json
MB-iSTFT-VITS2	`"subbands": 4,` `"mb_istft_vits": true,` `"upsample_rates": [4,4],`	mb_istft_vits2_base.json
MS-iSTFT-VITS2	`"subbands": 4,` `"ms_istft_vits": true,` `"upsample_rates": [4,4],`	ms_istft_vits2_base.json
Mini-iSTFT-VITS2	`"istft_vits": true,` `"upsample_rates": [8,8],` `"hidden_channels": 96,` `"n_layers": 3,`	mini_istft_vits2_base.json
Mini-MB-iSTFT-VITS2	`"subbands": 4,` `"mb_istft_vits": true,` `"upsample_rates": [4,4],` `"hidden_channels": 96,` `"n_layers": 3,` `"upsample_initial_channel": 256,`	mini_mb_istft_vits2_base.json

# train_ms.py for multi speaker
# train_l.py to use Lightning
python train_ms.py -c configs/shergin_d_vector_hfg.json -m models/test

If you have any questions regarding how to run it, contact us in Telegram

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
configs		configs
filelists		filelists
monotonic_align		monotonic_align
resources		resources
text		text
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
data_utils.py		data_utils.py
inference.ipynb		inference.ipynb
inference.py		inference.py
losses.py		losses.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
onnx_export.py		onnx_export.py
pqmf.py		pqmf.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
stft.py		stft.py
stft_loss.py		stft_loss.py
train.py		train.py
train_l.py		train_l.py
train_ms.py		train_ms.py
training_colab.ipynb		training_colab.ipynb
transforms.py		transforms.py
utils.py		utils.py