Disclaimer from the Voicecraft Github repo

Any organization or individual is prohibited from using any technology mentioned in this paper to generate or edit someone's speech without his/her consent, including but not limited to government leaders, political figures, and celebrities. If you do not comply with this item, you could be in violation of copyright laws.

Description

A dockerized version of VoiceCraft [CUDA only] offering a gradio interface voicecraft github and inspired by this webio implementation.

# docker build -t voicecrafter
# docker run --gpus=all -p 7860:7860 -it voicecrafter

Screenshot

Instructions

Run the above to start your instance
Visit the gradio.live or the local link (note: not currently authenticated so anyone with the link can use it)
Click the "Original Audio" tile to upload clear audio of only the subject speaking on the order of 5-10 seconds. Trim out anything longer and choose audio with no background noise or crackles and pops (file formats: mp3, m4a, wav)
Update the "original_transcript" with the transcript of the audio uploaded or leave the Autotranscribe input checkbox checked
Update "target_transcript" with the sentence or two of text you want to generate
Click "Run" to generate audio
Click the play button next to "Generated Audio" to hear the clip and the "..." to download

Models

Model	Parameters	Memory	Runs on
fast-whisper			CPU
voicecraft	330M	4GB+ VRAM	GPU
voicecraft	830M	6GB+ VRAM	GPU

Original VoiceCraft License

The codebase is under CC BY-NC-SA 4.0 (LICENSE-CODE), and the model weights are under Coqui Public Model License 1.0.0 (LICENSE-MODEL). Note that we use some of the code from other repository that are under different licenses: ./models/codebooks_patterns.py is under MIT license; ./models/modules, ./steps/optim.py, data/tokenizer.py are under Apache License, Version 2.0; the phonemizer we used is under GNU 3.0 License.

Please refer to the below for latest:

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
download_models.py		download_models.py
requirements-frozen.txt		requirements-frozen.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

app.py

app.py

download_models.py

download_models.py

requirements-frozen.txt

requirements-frozen.txt

Repository files navigation

Disclaimer from the Voicecraft Github repo

Description

Screenshot

Instructions

Models

Original VoiceCraft License

About

Releases

Packages

Languages

License

pselvana/VoiceCrafter

Folders and files

Latest commit

History

Repository files navigation

Disclaimer from the Voicecraft Github repo

Description

Screenshot

Instructions

Models

Original VoiceCraft License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages