Virbo: Multimodal Multilingual Avatar Video Generation in Digital   Marketing

Juan Zhang; Jiahao Chen; Cheng Wang; Zhiwang Yu; Tangquan Qi; Can Liu,; Di Wu

arXiv:2403.11700·cs.MM·March 25, 2024·1 cites

Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing

Juan Zhang, Jiahao Chen, Cheng Wang, Zhiwang Yu, Tangquan Qi, Can Liu,, Di Wu

PDF

Open Access

TL;DR

Virbo is an AI-powered system that automatically generates high-quality, multilingual talking avatar videos from scripts, significantly reducing production costs and time in digital marketing.

Contribution

The paper introduces Virbo, a novel multimodal multilingual avatar video generation system that automates and simplifies short video production for marketing.

Findings

01

Virbo produces videos comparable in quality to professional productions.

02

The system significantly reduces production costs and time.

03

It effectively supports multilingual and customizable avatar video generation.

Abstract

With the widespread popularity of internet celebrity marketing all over the world, short video production has gradually become a popular way of presenting products information. However, the traditional video production industry usually includes series of procedures as script writing, video filming in a professional studio, video clipping, special effects rendering, customized post-processing, and so forth. Not to mention that multilingual videos is not accessible for those who could not speak multilingual languages. These complicated procedures usually needs a professional team to complete, and this made short video production costly in both time and money. This paper presents an intelligent system that supports the automatic generation of talking avatar videos, namely Virbo. With simply a user-specified script, Virbo could use a deep generative model to generate a target talking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage, Metaphor, and Cognition