LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models
Jiangong Chen, Xiaoyi Wu, Tian Lan, Bin Li

TL;DR
This paper introduces LLMER, a framework that uses JSON data generated by large language models to create interactive XR worlds, reducing errors and processing time compared to script-based methods.
Contribution
LLMER is a novel approach that translates natural language into JSON data for XR environments, improving robustness and efficiency over prior script-focused methods.
Findings
Over 80% reduction in token consumption
Around 60% decrease in task completion time
Positive user feedback indicating system effectiveness
Abstract
The integration of Large Language Models (LLMs) like GPT-4 with Extended Reality (XR) technologies offers the potential to build truly immersive XR environments that interact with human users through natural language, e.g., generating and animating 3D scenes from audio inputs. However, the complexity of XR environments makes it difficult to accurately extract relevant contextual data and scene/object parameters from an overwhelming volume of XR artifacts. It leads to not only increased costs with pay-per-use models, but also elevated levels of generation errors. Moreover, existing approaches focusing on coding script generation are often prone to generation errors, resulting in flawed or invalid scripts, application crashes, and ultimately a degraded user experience. To overcome these challenges, we introduce LLMER, a novel framework that creates interactive XR worlds using JSON data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Educational Games and Gamification · Topic Modeling
MethodsAttention Is All You Need · Label Smoothing · Layer Normalization · Linear Layer · Byte Pair Encoding · Dense Connections · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam
