LLM-Driven Augmented Reality Puppeteer: Controller-Free Voice-Commanded Robot Teleoperation
Yuchong Zhang, Bastian Orthmann, Michael C. Welle, Jonne Van, Haastregt, Danica Kragic

TL;DR
This paper presents a novel controller-free AR system that uses large language models and voice commands to enable intuitive, safe, and immersive robot teleoperation without physical controllers.
Contribution
It introduces a new AR puppeteering system driven by LLMs and NLP, eliminating the need for physical controllers in robot teleoperation.
Findings
System successfully enables controller-free robot control
User demonstration validates system functionality
Potential for safer, more intuitive robotic interaction
Abstract
The integration of robotics and augmented reality (AR) presents transformative opportunities for advancing human-robot interaction (HRI) by improving usability, intuitiveness, and accessibility. This work introduces a controller-free, LLM-driven voice-commanded AR puppeteering system, enabling users to teleoperate a robot by manipulating its virtual counterpart in real time. By leveraging natural language processing (NLP) and AR technologies, our system -- prototyped using Meta Quest 3 -- eliminates the need for physical controllers, enhancing ease of use while minimizing potential safety risks associated with direct robot operation. A preliminary user demonstration successfully validated the system's functionality, demonstrating its potential for safer, more intuitive, and immersive robotic control.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTeleoperation and Haptic Systems
