OmniEgoCap: Camera-Agnostic Sequence-Level Egocentric Motion Reconstruction
Kyungwon Cho, Hanbyul Joo

TL;DR
OmniEgoCap introduces a sequence-level diffusion framework for egocentric motion reconstruction that generalizes across diverse hardware setups and captures global body attributes.
Contribution
It proposes a unified, hardware-agnostic approach using sequence-level inference and geometry-aware augmentation for natural, consistent 3D motion reconstruction.
Findings
Achieves state-of-the-art results on public benchmarks.
Demonstrates robustness across diverse in-the-wild environments.
Effectively recovers invariant physical attributes like height and body proportions.
Abstract
The proliferation of commercial egocentric devices offers a unique lens into human behavior, yet reconstructing full-body 3D motion remains difficult due to frequent self-occlusion and the 'out-of-sight' nature of the wearer's limbs. While head and hand trajectories provide sparse anchor points, current methods often overfit to specific hardware optics or rely on expensive, post-hoc optimizations that compromise motion naturalness. In this paper, we present OmniEgoCap, a unified diffusion framework that scales egocentric reconstruction to diverse capture setups. By shifting from short-term windowed estimation to sequence-level inference, our method captures a global perspective and recovers invariant physical attributes, such as height and body proportions, that provide critical constraints for disambiguating head-only cues. To ensure hardware-agnostic generalization, we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
