RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling
Jian Liao, Adnan Karim, Shivesh Jadon, Rubaiat Habib Kazi, Ryo Suzuki

TL;DR
RealityTalk is a system that enables real-time, speech-driven augmented presentations with interactive virtual elements, enhancing live storytelling without requiring extensive editing skills.
Contribution
It introduces novel interaction techniques for live augmented presentations and integrates them into a real-time system for speech-driven virtual element manipulation.
Findings
Effective real-time speech-driven interactions demonstrated
Enhanced engagement in live presentations achieved
System evaluated positively by presenters
Abstract
We present RealityTalk, a system that augments real-time live presentations with speech-driven interactive virtual elements. Augmented presentations leverage embedded visuals and animation for engaging and expressive storytelling. However, existing tools for live presentations often lack interactivity and improvisation, while creating such effects in video editing tools require significant time and expertise. RealityTalk enables users to create live augmented presentations with real-time speech-driven interactions. The user can interactively prompt, move, and manipulate graphical elements through real-time speech and supporting modalities. Based on our analysis of 177 existing video-edited augmented presentations, we propose a novel set of interaction techniques and then incorporated them into RealityTalk. We evaluate our tool from a presenter's perspective to demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
