Grasp-and-Lift: Executable 3D Hand-Object Interaction Reconstruction via Physics-in-the-Loop Optimization

Byeonggyeol Choi; Woojin Oh; Jongwoo Lim

arXiv:2601.18121·cs.RO·January 27, 2026

Grasp-and-Lift: Executable 3D Hand-Object Interaction Reconstruction via Physics-in-the-Loop Optimization

Byeonggyeol Choi, Woojin Oh, Jongwoo Lim

PDF

Open Access

TL;DR

This paper introduces a physics-in-the-loop optimization framework that refines 3D hand-object interaction trajectories from visual data into physically plausible motions, improving the realism and utility of manipulation datasets.

Contribution

It formulates a black-box optimization approach using CMA-ES and spline-based hand motion parameterization to generate physically valid hand-object interactions from visual trajectories.

Findings

01

Achieves lower pose errors compared to previous methods.

02

Recovers more accurate hand-object physical interactions.

03

Enables scalable generation of high-fidelity manipulation data.

Abstract

Dexterous hand manipulation increasingly relies on large-scale motion datasets with precise hand-object trajectory data. However, existing resources such as DexYCB and HO3D are primarily optimized for visual alignment but often yield physically implausible interactions when replayed in physics simulators, including penetration, missed contact, and unstable grasps. We propose a simulation-in-the-loop refinement framework that converts these visually aligned trajectories into physically executable ones. Our core contribution is to formulate this as a tractable black-box optimization problem. We parameterize the hand's motion using a low-dimensional, spline-based representation built on sparse temporal keyframes. This allows us to use a powerful gradient-free optimizer, CMA-ES, to treat the high-fidelity physics engine as a black-box objective function. Our method finds motions that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Robot Manipulation and Learning · Human Pose and Action Recognition