TiVGAN: Text to Image to Video Generation with Step-by-Step Evolutionary   Generator

Doyeon Kim; Donggyu Joo; Junmo Kim

arXiv:2009.02018·cs.CV·June 29, 2021

TiVGAN: Text to Image to Video Generation with Step-by-Step Evolutionary Generator

Doyeon Kim, Donggyu Joo, Junmo Kim

PDF

Open Access

TL;DR

TiVGAN introduces a step-by-step evolutionary GAN framework that generates high-resolution videos from text descriptions by progressively learning to produce frames, improving stability and quality.

Contribution

The paper presents a novel training framework for text-to-video generation that evolves frames sequentially, enhancing stability and resolution compared to existing methods.

Findings

01

Effective generation of high-resolution videos from text descriptions.

02

Stable training process through step-by-step frame evolution.

03

Quantitative and qualitative results demonstrate superior performance.

Abstract

Advances in technology have led to the development of methods that can create desired visual multimedia. In particular, image generation using deep learning has been extensively studied across diverse fields. In comparison, video generation, especially on conditional inputs, remains a challenging and less explored area. To narrow this gap, we aim to train our model to produce a video corresponding to a given text description. We propose a novel training framework, Text-to-Image-to-Video Generative Adversarial Network (TiVGAN), which evolves frame-by-frame and finally produces a full-length video. In the first phase, we focus on creating a high-quality single video frame while learning the relationship between the text and an image. As the steps proceed, our model is trained gradually on more number of consecutive frames.This step-by-step learning process helps stabilize the training and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Advanced Image Processing Techniques