TL;DR
MaMi-HOI is a hierarchical framework that harmonizes global kinematic fluidity with local geometric precision to generate realistic 3D human-object interactions, addressing the challenge of geometric forgetting in diffusion models.
Contribution
The paper introduces GAPA and KHA modules to explicitly re-inject object details and align whole-body posture, improving contact accuracy and motion naturalness in HOI generation.
Findings
Achieves both natural motion and precise contact in HOI generation.
Extends capabilities to long-term tasks with complex trajectories.
Effectively bridges global navigation and high-fidelity manipulation.
Abstract
Generating realistic 3D Human-Object Interactions (HOI) is a fundamental task for applications ranging from embodied AI to virtual content creation, which requires harmonizing high-level semantic intent with strict low-level physical constraints. Existing methods excel at semantic alignment, however, they struggle to maintain precise object contact. We reveal a key finding termed \textit{Geometric Forgetting}: as diffusion model depth increases, semantic feature tend to overshadow object geometry feature, causing the model to lose its perception to object geometry. To address this, we propose MaMi-HOI, a hierarchical framework reconciling \textbf{Ma}cro-level kinematic fluidity with \textbf{Mi}cro-level spatial precision. First, to counteract geometric forgetting, we introduce the Geometry-Aware Proximity Adapter (GAPA), which explicitly re-injects dense object details to perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
