GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Zachary Seymour, Niluthpol Chowdhury Mithun, Han-Pang Chiu, Supun, Samarasekera, Rakesh Kumar

TL;DR
GraphMapper is a method that enables autonomous agents to efficiently learn and utilize 3D scene graph representations for navigation and other tasks, improving interaction efficiency and scene understanding.
Contribution
We introduce GraphMapper, a novel approach that simultaneously learns scene graph representations and navigation policies, enhancing efficiency and versatility in autonomous navigation.
Findings
Fewer interactions needed for effective navigation.
Scene graph representations improve downstream task performance.
GraphMapper can be integrated with existing systems for better efficiency.
Abstract
Understanding the geometric relationships between objects in a scene is a core capability in enabling both humans and autonomous agents to navigate in new environments. A sparse, unified representation of the scene topology will allow agents to act efficiently to move through their environment, communicate the environment state with others, and utilize the representation for diverse downstream tasks. To this end, we propose a method to train an autonomous agent to learn to accumulate a 3D scene graph representation of its environment by simultaneously learning to navigate through said environment. We demonstrate that our approach, GraphMapper, enables the learning of effective navigation policies through fewer interactions with the environment than vision-based systems alone. Further, we show that GraphMapper can act as a modular scene encoder to operate alongside existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Advanced Graph Neural Networks
