202-People-Multi-angle-Lip-Multimodal-Video-Data

Description

202 People - Multi-angle Lip Multimodal Video Data. The collection environments include indoor natural light scenes and indoor fluorescent lamp scenes. The device is cellphone. The diversity includes multiple scenes, different ages, 13 shooting angles. The language is Mandarin Chinese. The recording content is general field, unlimited content. The data can be used in multi-modal learning algorithms research in speech and image fields.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1298?source=Github

Specifications

Data size

202 people, each person collects the audio and video data from 13 different angles +1 txt document

People distribution

race distribution: Asian (Indonesia), gender distribution: 89 males, 113 females, age distribution: 165 people aged 18-30, 32 people aged 31-45, and 5 people aged 46-60

Collecting environment

indoor natural light scenes, indoor fluorescent lamp scenes

Data diversity

including multiple scenes, different ages, different shooting angles

Device

cellphone, the resolution is 1,920*1,080

Collecting angle

audio and video data of front face, 3 angles left side face, 3 angles right side face, looking down, looking up, left side face down, right side face down, left side face up and right side face up all 13 different angles were collected at the same time

Recording content

general field, unlimited content

Language

Mandarin Chinese, each video is more than 20 seconds

Data format

the video data format is .mp4, the audio is greater than or equal to 16KHz, 16bit, the frame rate is 25-30 fps

Accuracy rata

the accuracy rate of sentence is more than 95%

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
001_female_30.png		001_female_30.png
002_male_29.png		002_male_29.png
040_male_21.png		040_male_21.png
090_female_38.png		090_female_38.png
156_male_42.png		156_male_42.png
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

001_female_30.png

001_female_30.png

002_male_29.png

002_male_29.png

040_male_21.png

040_male_21.png

090_female_38.png

090_female_38.png

156_male_42.png

156_male_42.png

README.md

README.md

Repository files navigation

202-People-Multi-angle-Lip-Multimodal-Video-Data

Description

Specifications

Data size

People distribution

Collecting environment

Data diversity

Device

Collecting angle

Recording content

Language

Data format

Accuracy rata

Licensing Information

About

Releases

Packages

Nexdata-AI/202-People-Multi-angle-Lip-Multimodal-Video-Data

Folders and files

Latest commit

History

Repository files navigation

202-People-Multi-angle-Lip-Multimodal-Video-Data

Description

Specifications

Data size

People distribution

Collecting environment

Data diversity

Device

Collecting angle

Recording content

Language

Data format

Accuracy rata

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks