Constructing Category-Specific Models for Monocular Object-SLAM
Parv Parkhiya, Rishabh Khawad, J. Krishna Murthy, Brojeshwar Bhowmick,, K. Madhava Krishna

TL;DR
This paper introduces a real-time monocular object-SLAM system that constructs category-level models from CAD data, enabling object shape estimation and improved robustness over traditional feature-based SLAM.
Contribution
It develops a novel rendering pipeline for data synthesis, learns category models for 3D deformations and 2D features, and integrates these into a monocular SLAM framework for improved object and scene understanding.
Findings
First instance-independent monocular object-SLAM system.
Enhanced robustness in feature-sparse scenarios.
Successful evaluation on real-world challenging scenes.
Abstract
We present a new paradigm for real-time object-oriented SLAM with a monocular camera. Contrary to previous approaches, that rely on object-level models, we construct category-level models from CAD collections which are now widely available. To alleviate the need for huge amounts of labeled data, we develop a rendering pipeline that enables synthesis of large datasets from a limited amount of manually labeled data. Using data thus synthesized, we learn category-level models for object deformations in 3D, as well as discriminative object features in 2D. These category models are instance-independent and aid in the design of object landmark observations that can be incorporated into a generic monocular SLAM framework. Where typical object-SLAM approaches usually solve only for object and camera poses, we also estimate object shape on-the-fly, allowing for a wide range of objects from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
