RITA: A Real-time Interactive Talking Avatars Framework
Wuxinlin Cheng, Cheng Wan, Yupeng Cao, Sihan Chen

TL;DR
RITA introduces a real-time framework for creating interactive digital avatars from photos, combining generative models with computer vision and NLP to enable engaging conversations in various virtual applications.
Contribution
The paper presents a novel real-time interactive avatar system that transforms user photos into conversational digital personas using advanced generative models.
Findings
Enables real-time photo-to-avatar transformation.
Supports dynamic conversational interactions.
Potential applications in VR, education, and gaming.
Abstract
RITA presents a high-quality real-time interactive framework built upon generative models, designed with practical applications in mind. Our framework enables the transformation of user-uploaded photos into digital avatars that can engage in real-time dialogue interactions. By leveraging the latest advancements in generative modeling, we have developed a versatile platform that not only enhances the user experience through dynamic conversational avatars but also opens new avenues for applications in virtual reality, online education, and interactive gaming. This work showcases the potential of integrating computer vision and natural language processing technologies to create immersive and interactive digital personas, pushing the boundaries of how we interact with digital content.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Speech and dialogue systems
