Closing the Loop: Unified 3D Scene Generation and Immersive Interaction via LLM-RL Coupling

Anh H. Vo; Sungyo Lee; Phil-Joong Kim; Soo-Mi Choi; and Yong-Guk Kim

arXiv:2605.05711·cs.CV·May 8, 2026

Closing the Loop: Unified 3D Scene Generation and Immersive Interaction via LLM-RL Coupling

Anh H. Vo, Sungyo Lee, Phil-Joong Kim, Soo-Mi Choi, and Yong-Guk Kim

PDF

1 Repo

TL;DR

This paper introduces a unified framework that integrates language-driven 3D scene generation with immersive user interaction, enhancing responsiveness and realism in multimedia systems.

Contribution

It presents a novel closed-loop system coupling large language models with reinforcement learning for adaptive 3D scene creation and interaction.

Findings

01

Achieved state-of-the-art results on the ALFRED benchmark.

02

Qualitative improvements in immersion and interaction quality.

03

User studies confirm increased task efficiency and realism.

Abstract

Recent advances in large language models (LLMs) have significantly improved language-driven 3D content generation, but most existing approaches still treat scene generation and user interaction as separate processes, limiting the adaptability and immersive potential of interactive multimedia systems. This paper presents a unified framework that closes the loop between language-driven 3D scene generation and immersive user interaction. Given natural language instructions, the system first constructs structured scene representations using LLMs, and then optimizes spatial layouts via reinforcement learning under geometric and semantic constraints. The generated environments are deployed in a virtual reality setting to facilitate HRI-in-the-loop, where user interactions provide continuous feedback to align generated content with human perception and usability. By tightly coupling generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://proj-showcase.github.io/h3ds
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.