Pre-Avatar: An Automatic Presentation Generation Framework Leveraging   Talking Avatar

Aolan Sun; Xulong Zhang; Tiandong Ling; Jianzong Wang; Ning Cheng,; Jing Xiao

arXiv:2210.06877·cs.AI·October 14, 2022

Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar

Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng,, Jing Xiao

PDF

Open Access

TL;DR

Pre-Avatar is an automated system that creates presentation videos featuring a talking avatar from a single photo and voice recording, reducing production costs for communication materials.

Contribution

It introduces a novel framework combining voice cloning, speech synthesis, and avatar animation to efficiently generate presentation videos from minimal input.

Findings

01

Successfully generates realistic talking avatars from a single photo and voice

02

Reduces time and cost in creating presentation videos

03

System is available as free software for public use

Abstract

Since the beginning of the COVID-19 pandemic, remote conferencing and school-teaching have become important tools. The previous applications aim to save the commuting cost with real-time interactions. However, our application is going to lower the production and reproduction costs when preparing the communication materials. This paper proposes a system called Pre-Avatar, generating a presentation video with a talking face of a target speaker with 1 front-face photo and a 3-minute voice recording. Technically, the system consists of three main modules, user experience interface (UEI), talking face module and few-shot text-to-speech (TTS) module. The system firstly clones the target speaker's voice, and then generates the speech, and finally generate an avatar with appropriate lip and head movements. Under any scenario, users only need to replace slides with different notes to generate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVirtual Reality Applications and Impacts · Video Analysis and Summarization · Multimedia Communication and Technology