LLM-Driven 3D Scene Generation of Agricultural Simulation Environments
Arafa Yoncalik, Wouter Jansen, Nico Huebel, Mohammad Hasan Rahmani, Jan Steckel

TL;DR
This paper presents a modular multi-LLM pipeline for generating realistic agricultural 3D environments from natural language prompts, improving control, scalability, and accuracy over previous monolithic approaches.
Contribution
It introduces a novel modular architecture integrating multiple LLM techniques for domain-specific 3D scene generation in Unreal Engine, with validation and user studies demonstrating effectiveness.
Findings
Achieved realistic planting layouts and environmental context based on prompts.
Significant time savings compared to manual scene creation.
Enhanced reliability and precision through hybrid LLM strategies.
Abstract
Procedural generation techniques in 3D rendering engines have revolutionized the creation of complex environments, reducing reliance on manual design. Recent approaches using Large Language Models (LLMs) for 3D scene generation show promise but often lack domain-specific reasoning, verification mechanisms, and modular design. These limitations lead to reduced control and poor scalability. This paper investigates the use of LLMs to generate agricultural synthetic simulation environments from natural language prompts, specifically to address the limitations of lacking domain-specific reasoning, verification mechanisms, and modular design. A modular multi-LLM pipeline was developed, integrating 3D asset retrieval, domain knowledge injection, and code generation for the Unreal rendering engine using its API. This results in a 3D environment with realistic planting layouts and environmental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Motion and Animation · Multimodal Machine Learning Applications
