From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping
Kaiyu Zheng, Andrzej Pronobis

TL;DR
This paper presents TopoNets, a probabilistic deep learning framework that models large-scale semantic maps by integrating multi-level spatial and semantic information, enabling robots to perform real-time, uncertain reasoning in complex environments.
Contribution
The paper introduces TopoNets, a novel end-to-end probabilistic deep network that models structured semantic maps with dynamic graphs for large-scale environments.
Findings
Successfully performs end-to-end semantic mapping from noisy data.
Enables uncertain reasoning about unexplored spaces.
Achieves real-time, exact inference in large environments.
Abstract
We introduce TopoNets, end-to-end probabilistic deep networks for modeling semantic maps with structure reflecting the topology of large-scale environments. TopoNets build a unified deep network spanning multiple levels of abstraction and spatial scales, from pixels representing geometry of local places to high-level descriptions of semantics of buildings. To this end, TopoNets leverage complex spatial relations expressed in terms of arbitrary, dynamic graphs. We demonstrate how TopoNets can be used to perform end-to-end semantic mapping from partial sensory observations and noisy topological relations discovered by a robot exploring large-scale office spaces. Thanks to their probabilistic nature and generative properties, TopoNets extend the problem of semantic mapping beyond classification. We show that TopoNets successfully perform uncertain reasoning about yet unexplored space and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Multimodal Machine Learning Applications
