Wav2Vec2FBX

Generate an FBX of a phoneme lip-sync animation from an sigle audio file, using Wav2Vec2 to analyze the phonemes for helps the animators starts with very basic animation.

Installation

Environment

Virtual environment, Python 3.7 is highly recommended, as it is supported by the FBX Python SDK.

Clone repository

git clone https://github.com/yamahigashi/Wav2Vec2FBX.git
cd Wav2Vec2FBX
pip install requirements.txt

FBX python SDK

https://www.autodesk.com/developer-network/platform-technologies/fbx-sdk-2020-0 download FBX SDK from autodesk and place libraries (fbx.pyd, FbxCommon.py and fbxsip.pyd) into lib folder.

Run

python main.py input_audio.wav

This will generate input_audio.fbx in the same folder as the input file.

Configuration

The behaviour can be changed by the configuration file assets/config.toml.

keyframes settings

[keyframes]
# ipa と無口を補完するフレーム
interpolation = 5

# 複数口形素からなる ipa を補完するフレーム
consecutive_viseme_frame = 3

audio settings section

Describes settings for preprocessing an audio file. It splits the file based on the silence, and if it is still too long, splits the file based on the settings.

[audio_settings]

# 無音期間を判定する際の最小ミリセク  (初期値 500)
min_silence_len_ms = 500

# 無音判定 (初期値 -36)
silence_thresh_db = -36

# 最長オーディオファイル。これ以上は複数に分割して処理 (初期値 5000)
maximum_duration_ms = 5000

ipa to arpabet table settings

The phonemes to morphemes correspondence table. The phonemes determined by Wav2Vec are mapped to oral morphemes. The list of morphonemes can be given as.

[ipa_to_arpabet]
'ɔ'      = ["a"]
'ɑ'     = ["a"]
'i'      = ["i"]
# Long Vowels
'e ː'   = ["e", "e"]
'o ː'   = ["o", "o"]

# -------- snip --------------

Build binary using cx_Freeze

You can deploy this package as binary for the environment without python using cx_Freeze.

python setup.py build

This will generate binary for your platform.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
assets		assets
lib		lib
src/Wav2Vec2FBX		src/Wav2Vec2FBX
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
mypy.ini		mypy.ini
pylintrc		pylintrc
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

License

yamahigashi/Wav2Vec2FBX

Folders and files

Latest commit

History

Repository files navigation

Wav2Vec2FBX

Table of contents

Installation

Environment

Clone repository

FBX python SDK

Run

Configuration

keyframes settings

audio settings section

ipa to arpabet table settings

Build binary using cx_Freeze

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages