ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization
Muhammad Zubair Irshad, Sergey Zakharov, Rares Ambrus, Thomas Kollar,, Zsolt Kira, Adrien Gaidon

TL;DR
ShAPO is a novel method for joint multi-object detection, 3D textured reconstruction, and pose estimation from a single RGB-D image, leveraging implicit representations and a differentiable optimization process.
Contribution
It introduces a single-shot pipeline with a disentangled shape and appearance database, and an octree-based optimization for improved accuracy in multi-object 3D understanding.
Findings
Outperforms baselines with 8% higher mAP on NOCS dataset
Accurately reconstructs unseen objects without prior 3D meshes
Effective in real-world scenarios with minimal fine-tuning
Abstract
Our method studies the complex task of object-centric 3D understanding from a single RGB-D observation. As it is an ill-posed problem, existing methods suffer from low performance for both 3D shape and 6D pose and size estimation in complex multi-object scenarios with occlusions. We present ShAPO, a method for joint multi-object detection, 3D textured reconstruction, 6D object pose and size estimation. Key to ShAPO is a single-shot pipeline to regress shape, appearance and pose latent codes along with the masks of each object instance, which is then further refined in a sparse-to-dense fashion. A novel disentangled shape and appearance database of priors is first learned to embed objects in their respective shape and appearance space. We also propose a novel, octree-based differentiable optimization step, allowing us to further improve object shape, pose and appearance simultaneously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
