Hydra: A Real-time Spatial Perception System for 3D Scene Graph Construction and Optimization
Nathan Hughes, Yun Chang, Luca Carlone

TL;DR
Hydra is a real-time system that incrementally builds and optimizes 3D scene graphs from sensor data, enabling robots to create detailed environmental models on-the-fly for improved spatial understanding.
Contribution
The paper introduces novel real-time algorithms for constructing and optimizing 3D scene graphs, including loop closure detection and hierarchical descriptors, in a robotic perception context.
Findings
Hydra achieves online 3D scene graph reconstruction with accuracy comparable to offline methods.
Hierarchical descriptors improve loop closure detection in 3D scene graphs.
The system effectively integrates perception layers for real-time environment understanding.
Abstract
3D scene graphs have recently emerged as a powerful high-level representation of 3D environments. A 3D scene graph describes the environment as a layered graph where nodes represent spatial concepts at multiple levels of abstraction and edges represent relations between concepts. While 3D scene graphs can serve as an advanced "mental model" for robots, how to build such a rich representation in real-time is still uncharted territory. This paper describes a real-time Spatial Perception System, a suite of algorithms to build a 3D scene graph from sensor data in real-time. Our first contribution is to develop real-time algorithms to incrementally construct the layers of a scene graph as the robot explores the environment; these algorithms build a local Euclidean Signed Distance Function (ESDF) around the current robot location, extract a topological map of places from the ESDF, and then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
