RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation
Sourav Garg, Krishan Rana, Mehdi Hosseinzadeh, Lachlan Mares, Niko, S\"underhauf, Feras Dayoub, Ian Reid

TL;DR
RoboHop introduces a novel segment-based topological map representation for open-world visual navigation, enabling semantic reasoning, object search, and improved localization through a graph of image segments.
Contribution
The paper proposes a new topological environment representation using semantically meaningful image segments, enhancing navigation and localization over previous pixel-based or 3D scene graph methods.
Findings
Effective segment-level localization using graph convolution
Navigation plans based on segment hops demonstrated in real-world data
Successful zero-shot navigation with segment hopping
Abstract
Mapping is crucial for spatial reasoning, planning and robot navigation. Existing approaches range from metric, which require precise geometry-based optimization, to purely topological, where image-as-node based graphs lack explicit object-level reasoning and interconnectivity. In this paper, we propose a novel topological representation of an environment based on "image segments", which are semantically meaningful and open-vocabulary queryable, conferring several advantages over previous works based on pixel-level features. Unlike 3D scene graphs, we create a purely topological graph with segments as nodes, where edges are formed by a) associating segment-level descriptors between pairs of consecutive images and b) connecting neighboring segments within an image using their pixel centroids. This unveils a "continuous sense of a place", defined by inter-image persistence of segments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Geographic Information Systems Studies · Robotics and Sensor-Based Localization
MethodsConvolution
