UniCon3R: Unified Contact-aware 4D Human-Scene Reconstruction from Monocular Video

Tanuj Sur; Shashank Tripathi; Nikos Athanasiou; Ha Linh Nguyen; Kai Xu; Michael J. Black; Angela Yao

arXiv:2604.19923·cs.CV·May 12, 2026

UniCon3R: Unified Contact-aware 4D Human-Scene Reconstruction from Monocular Video

Tanuj Sur, Shashank Tripathi, Nikos Athanasiou, Ha Linh Nguyen, Kai Xu, Michael J. Black, Angela Yao

PDF

1 Repo

TL;DR

UniCon3R is a real-time, contact-aware 4D human-scene reconstruction framework from monocular video that improves physical plausibility by modeling human-environment interactions.

Contribution

It introduces a novel contact inference mechanism that enhances joint human-scene reconstruction accuracy and realism in a fast, feed-forward manner.

Findings

01

Outperforms state-of-the-art methods on human motion estimation.

02

Improves physical plausibility by modeling contact interactions.

03

Maintains fast, feed-forward inference speeds.

Abstract

We introduce UniCon3R, a unified feed-forward framework for online human-scene 4D reconstruction from monocular video. Current feed-forward human-scene reconstruction methods suffer from artifacts, where bodies float above the ground or penetrate parts of the scene. A key reason is the lack of effective interaction modelling between the human and the environment. Our goal is to exploit contact between the human and the scene during inference to actively improve the human mesh reconstruction. To that end, we explicitly model interaction by inferring 4D contact from the human pose and scene geometry and use the contact as a corrective cue for generating the pose. This enables UniCon3R to jointly recover scene geometry and spatially aligned 4D humans within the scene. Experiments on standard human-centric video benchmarks show that UniCon3R outperforms state-of-the-art baselines on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://surtantheta.github.io/UniCon3R
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.