Label-Conditioned Next-Frame Video Generation with Neural Flows
David Donahue

TL;DR
This paper introduces a neural flow-based model called Glow for label-conditioned next-frame video generation, offering more stability and better evaluation metrics than traditional GANs or VAEs.
Contribution
The work demonstrates how to adapt Glow, a neural flow model, for conditional video generation and evaluates its performance using cross entropy, addressing stability and evaluation issues.
Findings
Glow provides stable training compared to GANs.
Conditional Glow can generate videos based on textual labels.
Cross entropy effectively evaluates model performance.
Abstract
Recent state-of-the-art video generation systems employ Generative Adversarial Networks (GANs) or Variational Autoencoders (VAEs) to produce novel videos. However, VAE models typically produce blurry outputs when faced with sub-optimal conditioning of the input, and GANs are known to be unstable for large output sizes. In addition, the output videos of these models are difficult to evaluate, partly because the GAN loss function is not an accurate measure of convergence. In this work, we propose using a state-of-the-art neural flow generator called Glow to generate videos conditioned on a textual label, one frame at a time. Neural flow models are more stable than standard GANs, as they only optimize a single cross entropy loss function, which is monotonic and avoids the circular convergence issues of the GAN minimax objective. In addition, we also show how to condition Glow on external…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Advanced Vision and Imaging
MethodsInvertible 1x1 Convolution · Activation Normalization · Affine Coupling · Normalizing Flows · 1x1 Convolution · GLOW · Convolution · USD Coin Customer Service Number +1-833-534-1729 · Dogecoin Customer Service Number +1-833-534-1729
