MA3DSG: Multi-Agent 3D Scene Graph Generation for Large-Scale Indoor Environments

Yirum Kim; Jaewoo Kim; Ue-Hwan Kim

arXiv:2602.04152·cs.RO·February 5, 2026

MA3DSG: Multi-Agent 3D Scene Graph Generation for Large-Scale Indoor Environments

Yirum Kim, Jaewoo Kim, Ue-Hwan Kim

PDF

Open Access

TL;DR

This paper introduces MA3DSG, a multi-agent framework for large-scale 3D scene graph generation that overcomes scalability limitations of single-agent methods, using a training-free graph alignment and a new benchmark.

Contribution

We propose the first multi-agent 3D scene graph generation framework with a training-free graph alignment algorithm and a comprehensive benchmark for diverse environments.

Findings

01

Enables scalable 3DSGG with multiple agents

02

Achieves effective graph merging without learnable parameters

03

Provides a versatile benchmark for evaluation

Abstract

Current 3D scene graph generation (3DSGG) approaches heavily rely on a single-agent assumption and small-scale environments, exhibiting limited scalability to real-world scenarios. In this work, we introduce Multi-Agent 3D Scene Graph Generation (MA3DSG) model, the first framework designed to tackle this scalability challenge using multiple agents. We develop a training-free graph alignment algorithm that efficiently merges partial query graphs from individual agents into a unified global scene graph. Leveraging extensive analysis and empirical insights, our approach enables conventional single-agent systems to operate collaboratively without requiring any learnable parameters. To rigorously evaluate 3DSGG performance, we propose MA3DSG-Bench-a benchmark that supports diverse agent configurations, domain sizes, and environmental conditions-providing a more general and extensible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · 3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization