Geometrically Plausible Object Pose Refinement using Differentiable Simulation
Anil Zeybek, Rhys Newbury, Snehal Dikhale, Nawid Jamali, Soshi Iba, Akansel Cosgun

TL;DR
This paper introduces a multi-modal pose refinement method that uses differentiable physics simulation and rendering to improve the geometric plausibility and accuracy of object poses in manipulation tasks.
Contribution
It presents a novel approach combining differentiable physics, rendering, and visuo-tactile sensing for physically consistent pose refinement, outperforming standard methods.
Findings
Reduces intersection volume error by 73% with accurate initial estimates.
Achieves over 87% reduction in intersection error under high uncertainty.
Improves geometric plausibility while decreasing translation and orientation errors.
Abstract
State-of-the-art object pose estimation methods are prone to generating geometrically infeasible pose hypotheses. This problem is prevalent in dexterous manipulation, where estimated poses often intersect with the robotic hand or are not lying on a support surface. We propose a multi-modal pose refinement approach that combines differentiable physics simulation, differentiable rendering and visuo-tactile sensing to optimize object poses for both spatial accuracy and physical consistency. Simulated experiments show that our approach reduces the intersection volume error between the object and robotic hand by 73\% when the initial estimate is accurate and by over 87\% under high initial uncertainty, significantly outperforming standard ICP-based baselines. Furthermore, the improvement in geometric plausibility is accompanied by a concurrent reduction in translation and orientation errors.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Motor Control and Adaptation
