Speech-driven Hand Gesture Generation Demo

This repository can be used to reproduce our results of applying our model to the English dataset.

If you want to learn more about the model - this video is a good start.

Example of generate motion can be seen in the demo video.

Requirements

python 3
ffmpeg (to visualize the results)

Install dependencies

pip install --upgrade pip
pip install -r requirements.txt

Usage

./generate.sh  data/audio*.wav

Where in place of audio*.wav you can use any file from the folder data, which are chunks of the test sequences. Alternatively, you can download more audios for testing from the Trinity Speech-Gesture dataset. (The recordings 'NaturalTalking_01.wav' and 'NaturalTalking_02.wav' were not used in training and were left them for testing)

Training on your own data

For training on your own data we refer you to the original repository with the official implementation of the paper.

Citation

Here is the citation of our paper in bib format:

@article{kucherenko2021moving,
author = {Taras Kucherenko and Dai Hasegawa and Naoshi Kaneko and Gustav Eje Henter and Hedvig Kjellström},
title = {Moving Fast and Slow: Analysis of Representations and Post-Processing in Speech-Driven Automatic Gesture Generation},
journal = {International Journal of Human–Computer Interaction},
volume = {37},
number = {14},
pages = {1300-1316},
year  = {2021},
publisher = {Taylor & Francis},
doi = {10.1080/10447318.2021.1883883},
URL = {https://doi.org/10.1080/10447318.2021.1883883},
eprint = {https://doi.org/10.1080/10447318.2021.1883883}
}

If you are going to use Trinity Speech-Gesture dataset, please don't forget to cite them as described in their website

Contact

If you encounter any problems/bugs/issues please contact me on Github or by emailing me at [email protected] for any bug reports/questions/suggestions. I prefer questions and bug reports on Github as that provides visibility to others who might be encountering same issues or who have the same questions.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
helpers		helpers
models		models
visualization		visualization
LICENSE		LICENSE
README.md		README.md
create_video.sh		create_video.sh
decode_motion.py		decode_motion.py
encode_audio.py		encode_audio.py
generate.sh		generate.sh
predict.py		predict.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

helpers

helpers

models

models

visualization

visualization

LICENSE

LICENSE

README.md

README.md

create_video.sh

create_video.sh

decode_motion.py

decode_motion.py

encode_audio.py

encode_audio.py

generate.sh

generate.sh

predict.py

predict.py

requirements.txt

requirements.txt

Repository files navigation

Speech-driven Hand Gesture Generation Demo

Requirements

Install dependencies

Usage

Training on your own data

Citation

Contact

About

Contributors 2

Languages

License

Svito-zar/speech-driven-hand-gesture-generation-demo

Folders and files

Latest commit

History

Repository files navigation

Speech-driven Hand Gesture Generation Demo

Requirements

Install dependencies

Usage

Training on your own data

Citation

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages