TL;DR
HandX introduces a comprehensive dataset, annotation strategy, and benchmarks for realistic bimanual hand motion synthesis, emphasizing fine-grained finger dynamics and coordination.
Contribution
The paper presents a new high-quality bimanual motion dataset, a scalable annotation method using language models, and benchmarks for dexterous motion generation.
Findings
Larger models trained on bigger datasets produce more coherent bimanual motion.
The proposed metrics effectively evaluate dexterous hand motion quality.
The dataset supports future research in realistic hand interaction synthesis.
Abstract
Synthesizing human motion has advanced rapidly, yet realistic hand motion and bimanual interaction remain underexplored. Whole-body models often miss the fine-grained cues that drive dexterous behavior, finger articulation, contact timing, and inter-hand coordination, and existing resources lack high-fidelity bimanual sequences that capture nuanced finger dynamics and collaboration. To fill this gap, we present HandX, a unified foundation spanning data, annotation, and evaluation. We consolidate and filter existing datasets for quality, and collect a new motion-capture dataset targeting underrepresented bimanual interactions with detailed finger dynamics. For scalable annotation, we introduce a decoupled strategy that extracts representative motion features, e.g., contact events and finger flexion, and then leverages reasoning from large language models to produce fine-grained,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
