Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning

Albert Gassol Puigjaner; Angelos Zacharia; Kostas Alexis

arXiv:2602.02456·cs.RO·February 3, 2026

Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning

Albert Gassol Puigjaner, Angelos Zacharia, Kostas Alexis

PDF

Open Access

TL;DR

This paper presents a hierarchical 3D scene graph framework that integrates open-vocabulary features and relational reasoning, enabling autonomous agents to better understand and interact with complex environments.

Contribution

It introduces an enhanced scene graph that combines VLM and LLM for semantic and relational reasoning across multiple abstraction levels.

Findings

01

Effective scene understanding in diverse environments

02

Improved task reasoning capabilities for robots

03

Successful deployment on quadruped robot

Abstract

Representing and understanding 3D environments in a structured manner is crucial for autonomous agents to navigate and reason about their surroundings. While traditional Simultaneous Localization and Mapping (SLAM) methods generate metric reconstructions and can be extended to metric-semantic mapping, they lack a higher level of abstraction and relational reasoning. To address this gap, 3D scene graphs have emerged as a powerful representation for capturing hierarchical structures and object relationships. In this work, we propose an enhanced hierarchical 3D scene graph that integrates open-vocabulary features across multiple abstraction levels and supports object-relational reasoning. Our approach leverages a Vision Language Model (VLM) to infer semantic relationships. Notably, we introduce a task reasoning module that combines Large Language Models (LLM) and a VLM to interpret the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Robotics and Sensor-Based Localization · Constraint Satisfaction and Optimization