Pixels-to-Graph: Real-time Integration of Building Information Models and Scene Graphs for Semantic-Geometric Human-Robot Understanding
Antonello Longo, Chanyoung Chung, Matteo Palieri, Sung-Kyun Kim, Ali Agha, Cataldo Guaragnella, Shehryar Khattak

TL;DR
This paper presents Pixels-to-Graph, a lightweight real-time method for generating structured scene graphs from images and LiDAR data, enhancing human-robot understanding and cooperation in unknown environments on resource-limited robots.
Contribution
The paper introduces Pixels-to-Graph, a novel CPU-only framework that creates structured scene graphs from images and LiDAR data for real-time autonomous exploration.
Findings
Effective real-time scene graph generation demonstrated on NASA JPL robot
Produces de-noised top-down maps and structured 3D point clouds
Enables improved human-robot environment understanding
Abstract
Autonomous robots are increasingly playing key roles as support platforms for human operators in high-risk, dangerous applications. To accomplish challenging tasks, an efficient human-robot cooperation and understanding is required. While typically robotic planning leverages 3D geometric information, human operators are accustomed to a high-level compact representation of the environment, like top-down 2D maps representing the Building Information Model (BIM). 3D scene graphs have emerged as a powerful tool to bridge the gap between human readable 2D BIM and the robot 3D maps. In this work, we introduce Pixels-to-Graph (Pix2G), a novel lightweight method to generate structured scene graphs from image pixels and LiDAR maps in real-time for the autonomous exploration of unknown environments on resource-constrained robot platforms. To satisfy onboard compute constraints, the framework is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Modeling in Geospatial Applications · 3D Surveying and Cultural Heritage
