Part-level Scene Reconstruction Affords Robot Interaction
Zeyu Zhang, Lexing Zhang, Zaijin Wang, Ziyuan Jiao, Muzhi Han, Yixin, Zhu, Song-Chun Zhu, Hangxin Liu

TL;DR
This paper presents a part-level scene reconstruction method that accurately replicates observed scenes using primitive shapes, enabling precise robot interaction simulation and improved scene understanding.
Contribution
The work introduces a novel part-level reconstruction approach that segments objects into semantic parts, aligns primitives, and estimates kinematic relations for more accurate scene modeling.
Findings
Outperforms object-level reconstruction in capturing scene details
Provides precise kinematic information for robotic applications
Enables validation of manipulation planning in reconstructed scenes
Abstract
Existing methods for reconstructing interactive scenes primarily focus on replacing reconstructed objects with CAD models retrieved from a limited database, resulting in significant discrepancies between the reconstructed and observed scenes. To address this issue, our work introduces a part-level reconstruction approach that reassembles objects using primitive shapes. This enables us to precisely replicate the observed physical scenes and simulate robot interactions with both rigid and articulated objects. By segmenting reconstructed objects into semantic parts and aligning primitive shapes to these parts, we assemble them as CAD models while estimating kinematic relations, including parent-child contact relations, joint types, and parameters. Specifically, we derive the optimal primitive alignment by solving a series of optimization problems, and estimate kinematic relations based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Robotics and Sensor-Based Localization
