Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar
Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng,, Jing Xiao

TL;DR
Pre-Avatar is an automated system that creates presentation videos featuring a talking avatar from a single photo and voice recording, reducing production costs for communication materials.
Contribution
It introduces a novel framework combining voice cloning, speech synthesis, and avatar animation to efficiently generate presentation videos from minimal input.
Findings
Successfully generates realistic talking avatars from a single photo and voice
Reduces time and cost in creating presentation videos
System is available as free software for public use
Abstract
Since the beginning of the COVID-19 pandemic, remote conferencing and school-teaching have become important tools. The previous applications aim to save the commuting cost with real-time interactions. However, our application is going to lower the production and reproduction costs when preparing the communication materials. This paper proposes a system called Pre-Avatar, generating a presentation video with a talking face of a target speaker with 1 front-face photo and a 3-minute voice recording. Technically, the system consists of three main modules, user experience interface (UEI), talking face module and few-shot text-to-speech (TTS) module. The system firstly clones the target speaker's voice, and then generates the speech, and finally generate an avatar with appropriate lip and head movements. Under any scenario, users only need to replace slides with different notes to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVirtual Reality Applications and Impacts · Video Analysis and Summarization · Multimedia Communication and Technology
