OmniEgoCap: Camera-Agnostic Sequence-Level Egocentric Motion Reconstruction

Kyungwon Cho; Hanbyul Joo

arXiv:2512.19283·cs.CV·April 2, 2026

OmniEgoCap: Camera-Agnostic Sequence-Level Egocentric Motion Reconstruction

Kyungwon Cho, Hanbyul Joo

PDF

TL;DR

OmniEgoCap introduces a sequence-level diffusion framework for egocentric motion reconstruction that generalizes across diverse hardware setups and captures global body attributes.

Contribution

It proposes a unified, hardware-agnostic approach using sequence-level inference and geometry-aware augmentation for natural, consistent 3D motion reconstruction.

Findings

01

Achieves state-of-the-art results on public benchmarks.

02

Demonstrates robustness across diverse in-the-wild environments.

03

Effectively recovers invariant physical attributes like height and body proportions.

Abstract

The proliferation of commercial egocentric devices offers a unique lens into human behavior, yet reconstructing full-body 3D motion remains difficult due to frequent self-occlusion and the 'out-of-sight' nature of the wearer's limbs. While head and hand trajectories provide sparse anchor points, current methods often overfit to specific hardware optics or rely on expensive, post-hoc optimizations that compromise motion naturalness. In this paper, we present OmniEgoCap, a unified diffusion framework that scales egocentric reconstruction to diverse capture setups. By shifting from short-term windowed estimation to sequence-level inference, our method captures a global perspective and recovers invariant physical attributes, such as height and body proportions, that provide critical constraints for disambiguating head-only cues. To ensure hardware-agnostic generalization, we introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.