Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset

Motion-X: A Large-scale 3D Expressive
Whole-body Human Motion Dataset

^*Equal Contribution, ^† Corresponding Author,
¹International Digital Economy Academy (IDEA), ²Tsinghua University ³The Chinese University of Hong Kong, Shenzhen

Abstract

We propose Motion-X, a large-scale 3D expressive whole-body motion dataset.

Existing motion datasets predominantly contain body-only poses, lacking facial expressions, hand gestures, and fine-grained pose descriptions. Moreover, they are primarily collected from limited laboratory scenes with textual descriptions manually labeled, limiting their scalability. To overcome these limitations, we develop a whole-body motion and text annotation pipeline, which can automatically annotate motion from either single- or multi-view videos and provide comprehensive semantic labels for each video and fine-grained whole-body pose descriptions for each frame. This pipeline is of high precision, cost-effective, and scalable for further research.

Based on it, we construct Motion-X, which comprises 15.6M precise 3D whole-body pose annotations (i.e., SMPL-X) covering 81.1K motion sequences from massive scenes. Besides, Motion-X provides 15.6M frame-level whole-body pose descriptions and 81.1K sequence-level semantic labels.

Comprehensive experiments demonstrate the accuracy of the annotation pipeline and the significant benefit of Motion-X in enhancing expressive, diverse, and natural motion generation, as well as the 3D whole-body human mesh recovery task.

License

All data is distributed under the CC BY-NC-SA (Attribution-NonCommercial-ShareAlike) license. For the sub-datasets, although we annotate the motion-text labels with our annoation pipeline, we would ask the user to read the original license of each original dataset, and we would only provide our annotated result to the user with the approvals from the original Institution. Here we provide the link of the used assets:

BAUM, AIST++, EgoBody datasets are CC-BY 4.0 licensed.

HAA500 dataset is MIT licensed.

HuMMan dataset is under S-Lab License v1.0.

GRAB, AMASS dataset is released for academic research only and is free to researchers from educational or research institutes for non-commercial purposes.

Other data is under CC BY-SA 4.0 license.