Occlusion-Aware Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction

Hyungjun Doh; Dong In Lee; Seunggeun Chi; Pin-Hao Huang; Kwonjoon Lee; Sangpil Kim; Karthik Ramani

arXiv:2507.08137·cs.CV·September 16, 2025

Occlusion-Aware Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction

Hyungjun Doh, Dong In Lee, Seunggeun Chi, Pin-Hao Huang, Kwonjoon Lee, Sangpil Kim, Karthik Ramani

PDF

TL;DR

This paper presents a new framework for reconstructing 3D human-object interactions from monocular videos that effectively handles occlusions and ensures temporal consistency through amodal completion and temporal integration.

Contribution

It introduces a template-free, temporally coherent amodal completion method that improves 3D reconstruction accuracy in occluded and dynamic scenes without relying on predefined models.

Findings

01

Outperforms existing methods in occlusion handling

02

Maintains temporal stability in reconstructions

03

Achieves higher precision with 3D Gaussian Splatting

Abstract

We introduce a novel framework for reconstructing dynamic human-object interactions from monocular video that overcomes challenges associated with occlusions and temporal inconsistencies. Traditional 3D reconstruction methods typically assume static objects or full visibility of dynamic subjects, leading to degraded performance when these assumptions are violated-particularly in scenarios where mutual occlusions occur. To address this, our framework leverages amodal completion to infer the complete structure of partially obscured regions. Unlike conventional approaches that operate on individual frames, our method integrates temporal context, enforcing coherence across video sequences to incrementally refine and stabilize reconstructions. This template-free strategy adapts to varying conditions without relying on predefined models, significantly enhancing the recovery of intricate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.