AgentSGEN: Multi-Agent LLM in the Loop for Semantic Collaboration and GENeration of Synthetic Data
Vu Dinh Xuan, Hao Vo, David Murphy, Hoang D. Nguyen

TL;DR
AgentSGEN introduces a multi-agent LLM-based framework that iteratively collaborates to generate semantically rich synthetic safety-critical scenes, addressing data scarcity in safety-critical AI applications.
Contribution
The paper presents a novel multi-agent system employing LLMs for iterative, semantic-aware synthetic data generation tailored to safety-critical scenarios.
Findings
Effective generation of safety-critical scenes with semantic depth
Balances safety constraints with visual realism
Outperforms prior synthetic data methods in semantic consistency
Abstract
The scarcity of data depicting dangerous situations presents a major obstacle to training AI systems for safety-critical applications, such as construction safety, where ethical and logistical barriers hinder real-world data collection. This creates an urgent need for an end-to-end framework to generate synthetic data that can bridge this gap. While existing methods can produce synthetic scenes, they often lack the semantic depth required for scene simulations, limiting their effectiveness. To address this, we propose a novel multi-agent framework that employs an iterative, in-the-loop collaboration between two agents: an Evaluator Agent, acting as an LLM-based judge to enforce semantic consistency and safety-specific constraints, and an Editor Agent, which generates and refines scenes based on this guidance. Powered by LLM's capabilities to reasoning and common-sense knowledge, this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies
