RITA: A Real-time Interactive Talking Avatars Framework

Wuxinlin Cheng; Cheng Wan; Yupeng Cao; Sihan Chen

arXiv:2406.13093·cs.CV·June 21, 2024

RITA: A Real-time Interactive Talking Avatars Framework

Wuxinlin Cheng, Cheng Wan, Yupeng Cao, Sihan Chen

PDF

Open Access 1 Repo

TL;DR

RITA introduces a real-time framework for creating interactive digital avatars from photos, combining generative models with computer vision and NLP to enable engaging conversations in various virtual applications.

Contribution

The paper presents a novel real-time interactive avatar system that transforms user photos into conversational digital personas using advanced generative models.

Findings

01

Enables real-time photo-to-avatar transformation.

02

Supports dynamic conversational interactions.

03

Potential applications in VR, education, and gaming.

Abstract

RITA presents a high-quality real-time interactive framework built upon generative models, designed with practical applications in mind. Our framework enables the transformation of user-uploaded photos into digital avatars that can engage in real-time dialogue interactions. By leveraging the latest advancements in generative modeling, we have developed a versatile platform that not only enhances the user experience through dynamic conversational avatars but also opens new avenues for applications in virtual reality, online education, and interactive gaming. This work showcases the potential of integrating computer vision and natural language processing technologies to create immersive and interactive digital personas, pushing the boundaries of how we interact with digital content.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MindSpore-scientific/code-14/tree/main/RITA
mindspore

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Speech and dialogue systems