EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild

Yumeng Liu; Xiaoxiao Long; Zemin Yang; Yuan Liu; Marc Habermann; Christian Theobalt; Yuexin Ma; Wenping Wang

arXiv:2411.14280·cs.CV·December 22, 2025

EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild

Yumeng Liu, Xiaoxiao Long, Zemin Yang, Yuan Liu, Marc Habermann, Christian Theobalt, Yuexin Ma, Wenping Wang

PDF

Open Access 1 Repo

TL;DR

EasyHOI leverages large foundational models and a prior-guided optimization scheme to reconstruct diverse hand-object interactions from single images, outperforming existing methods in accuracy and robustness.

Contribution

The paper introduces a novel pipeline combining large models and prior-guided optimization for single-view hand-object interaction reconstruction, addressing challenges of ambiguity and occlusion.

Findings

01

Outperforms baseline methods across multiple datasets.

02

Faithfully reconstructs diverse hand-object interactions.

03

Utilizes large models for robust initial estimates.

Abstract

Our work aims to reconstruct hand-object interactions from a single-view image, which is a fundamental but ill-posed task. Unlike methods that reconstruct from videos, multi-view images, or predefined 3D templates, single-view reconstruction faces significant challenges due to inherent ambiguities and occlusions. These challenges are further amplified by the diverse nature of hand poses and the vast variety of object shapes and sizes. Our key insight is that current foundational models for segmentation, inpainting, and 3D reconstruction robustly generalize to in-the-wild images, which could provide strong visual and geometric priors for reconstructing hand-object interactions. Specifically, given a single image, we first design a novel pipeline to estimate the underlying hand pose and object shape using off-the-shelf large models. Furthermore, with the initial reconstruction, we employ…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lym29/EasyHOI
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · Context-Aware Activity Recognition Systems

MethodsSparse Evolutionary Training