CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects
Nick Heppert, Muhammad Zubair Irshad, Sergey Zakharov, Katherine Liu,, Rares Andrei Ambrus, Jeannette Bohg, Abhinav Valada, Thomas Kollar

TL;DR
CARTO is a new method that reconstructs multiple articulated objects from a single stereo image using implicit representations, achieving high accuracy and real-time inference, and transferring from simulation to real-world data.
Contribution
It introduces a unified decoder for multiple object categories and demonstrates effective reconstruction and pose estimation from a single stereo image.
Findings
20.4% improvement in mAP 3D IOU50 over baseline
Real-time inference at 1 Hz for up to 8 objects
Successful transfer from simulated to real-world data
Abstract
We present CARTO, a novel approach for reconstructing multiple articulated objects from a single stereo RGB observation. We use implicit object-centric representations and learn a single geometry and articulation decoder for multiple object categories. Despite training on multiple categories, our decoder achieves a comparable reconstruction accuracy to methods that train bespoke decoders separately for each category. Combined with our stereo image encoder we infer the 3D shape, 6D pose, size, joint type, and the joint state of multiple unknown objects in a single forward pass. Our method achieves a 20.4% absolute improvement in mAP 3D IOU50 for novel instances when compared to a two-stage pipeline. Inference time is fast and can run on a NVIDIA TITAN XP GPU at 1 HZ for eight or less objects present. While only trained on simulated data, CARTO transfers to real-world object instances.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Advanced Neural Network Applications · Image and Object Detection Techniques
