When Generative AI Meets Extended Reality: Enabling Scalable and Natural Interactions
Mingyu Zhu, Jiangong Chen, Bin Li

TL;DR
This paper explores integrating Generative AI with Extended Reality to enable scalable, natural interactions by automating content creation and interpreting intuitive language commands, thus reducing barriers to widespread XR adoption.
Contribution
It introduces novel methods for combining GenAI with XR, demonstrating practical use cases that enhance scalability and natural interaction in immersive environments.
Findings
GenAI can interpret ambiguous instructions in XR environments.
Automated 3D content generation lowers content creation barriers.
Integration improves user experience and interaction naturalness.
Abstract
Extended Reality (XR), including virtual, augmented, and mixed reality, provides immersive and interactive experiences across diverse applications, from VR-based education to AR-based assistance and MR-based training. However, widespread XR adoption remains limited due to two key challenges: 1) the high cost and complexity of authoring 3D content, especially for large-scale environments or complex interactions; and 2) the steep learning curve associated with non-intuitive interaction methods like handheld controllers or scripted gestures. Generative AI (GenAI) presents a promising solution by enabling intuitive, language-driven interaction and automating content generation. Leveraging vision-language models and diffusion-based generation, GenAI can interpret ambiguous instructions, understand physical scenes, and generate or manipulate 3D content, significantly lowering barriers to XR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAugmented Reality Applications · Interactive and Immersive Displays · Human Motion and Animation
