BirdSLAM: Monocular Multibody SLAM in Bird's-Eye View
Swapnil Daga, Gokul B. Nair, Anirudha Ramesh, Rahul Sajnani, Junaid, Ahmed Ansari, K. Madhava Krishna

TL;DR
BirdSLAM introduces a monocular SLAM system that uses bird's-eye view for improved localization and mapping in autonomous driving, effectively handling scale ambiguity and dynamic objects with minimal assumptions.
Contribution
It presents a novel bird's-eye view based SLAM approach that addresses key challenges of monocular systems with minimal prior information.
Findings
Outperforms prior monocular SLAM methods in accuracy.
Effectively localizes dynamic traffic participants.
Ablation analysis highlights key design choices.
Abstract
In this paper, we present BirdSLAM, a novel simultaneous localization and mapping (SLAM) system for the challenging scenario of autonomous driving platforms equipped with only a monocular camera. BirdSLAM tackles challenges faced by other monocular SLAM systems (such as scale ambiguity in monocular reconstruction, dynamic object localization, and uncertainty in feature representation) by using an orthographic (bird's-eye) view as the configuration space in which localization and mapping are performed. By assuming only the height of the ego-camera above the ground, BirdSLAM leverages single-view metrology cues to accurately localize the ego-vehicle and all other traffic participants in bird's-eye view. We demonstrate that our system outperforms prior work that uses strictly greater information, and highlight the relevance of each design decision via an ablation analysis.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · Tactile and Sensory Interactions
