EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding
Yuan-Ming Li, Wei-Jin Huang, An-Lan Wang, Ling-An Zeng, Jing-Ke Meng,, and Wei-Shi Zheng

TL;DR
EgoExo-Fitness introduces a comprehensive dataset with synchronized egocentric and exocentric videos, rich annotations, and benchmarks for full-body action understanding, enabling advances in multi-view action analysis and interpretation.
Contribution
The paper presents a novel dataset with synchronized multi-view videos, detailed annotations, and benchmarks for egocentric and exocentric full-body action understanding, including new interpretability and quality assessment tasks.
Findings
New dataset with rich annotations and synchronized views
Benchmarks for multiple action understanding tasks
Analysis demonstrating dataset utility and potential applications
Abstract
We present EgoExo-Fitness, a new full-body action understanding dataset, featuring fitness sequence videos recorded from synchronized egocentric and fixed exocentric (third-person) cameras. Compared with existing full-body action understanding datasets, EgoExo-Fitness not only contains videos from first-person perspectives, but also provides rich annotations. Specifically, two-level temporal boundaries are provided to localize single action videos along with sub-steps of each action. More importantly, EgoExo-Fitness introduces innovative annotations for interpretable action judgement--including technical keypoint verification, natural language comments on action execution, and action quality scores. Combining all of these, EgoExo-Fitness provides new resources to study egocentric and exocentric full-body action understanding across dimensions of "what", "when", and "how well". To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Action Observation and Synchronization
