Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning

Buzhen Huang; Chen Li; Chongyang Xu; Dongyue Lu; Jinnan Chen; Yangang Wang; Gim Hee Lee

arXiv:2507.02565·cs.CV·July 4, 2025

Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning

Buzhen Huang, Chen Li, Chongyang Xu, Dongyue Lu, Jinnan Chen, Yangang Wang, Gim Hee Lee

PDF

TL;DR

This paper introduces a dual-branch optimization framework leveraging human appearance and proxemics to improve the reconstruction of close human interactions in challenging in-the-wild videos, surpassing existing methods.

Contribution

It proposes a novel framework that combines appearance cues, social proxemics, and physical constraints with a diffusion model to accurately reconstruct interactive human motions.

Findings

01

Outperforms existing pose estimation methods on several benchmarks.

02

Successfully reconstructs plausible close interactions in complex environments.

03

Builds a new dataset with pseudo ground-truth annotations for future research.

Abstract

Due to visual ambiguities and inter-person occlusions, existing human pose estimation methods cannot recover plausible close interactions from in-the-wild videos. Even state-of-the-art large foundation models~(\eg, SAM) cannot accurately distinguish human semantics in such challenging scenarios. In this work, we find that human appearance can provide a straightforward cue to address these obstacles. Based on this observation, we propose a dual-branch optimization framework to reconstruct accurate interactive motions with plausible body contacts constrained by human appearances, social proxemics, and physical laws. Specifically, we first train a diffusion model to learn the human proxemic behavior and pose prior knowledge. The trained network and two optimizable tensors are then incorporated into a dual-branch optimization framework to reconstruct human motions and appearances. Several…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion