Video prediction using score-based conditional density estimation
Pierre-\'Etienne H. Fiquet, Eero P. Simoncelli

TL;DR
This paper introduces a score-based implicit density estimation framework for video prediction that effectively models uncertainty and handles occlusions, outperforming classical methods in selecting probable future frames.
Contribution
It presents a novel deep learning approach for conditional density estimation in video prediction, capable of managing high-dimensional data and ambiguous future states.
Findings
Handles occlusion boundaries better than classical methods
Automatically weights predictive evidence by reliability
Extracts adaptive representations for temporal prediction
Abstract
Temporal prediction is inherently uncertain, but representing the ambiguity in natural image sequences is a challenging high-dimensional probabilistic inference problem. For natural scenes, the curse of dimensionality renders explicit density estimation statistically and computationally intractable. Here, we describe an implicit regression-based framework for learning and sampling the conditional density of the next frame in a video given previous observed frames. We show that sequence-to-image deep networks trained on a simple resilience-to-noise objective function extract adaptive representations for temporal prediction. Synthetic experiments demonstrate that this score-based framework can handle occlusion boundaries: unlike classical methods that average over bifurcating temporal trajectories, it chooses among likely trajectories, selecting more probable options with higher…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
