Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic
Sadegh Aliakbarian

TL;DR
This paper explores advanced deep learning models for video anticipation, transitioning from predicting coarse, deterministic futures to detailed, stochastic ones, addressing the inherent ambiguity in future video representations.
Contribution
It introduces novel methods for predicting both deterministic and stochastic future representations, advancing the capability of video anticipation models.
Findings
Effective prediction of coarse deterministic futures
Successful modeling of diverse stochastic continuations
Improved accuracy over previous methods
Abstract
Video anticipation is the task of predicting one/multiple future representation(s) given limited, partial observation. This is a challenging task due to the fact that given limited observation, the future representation can be highly ambiguous. Based on the nature of the task, video anticipation can be considered from two viewpoints: the level of details and the level of determinism in the predicted future. In this research, we start from anticipating a coarse representation of a deterministic future and then move towards predicting continuous and fine-grained future representations of a stochastic process. The example of the former is video action anticipation in which we are interested in predicting one action label given a partially observed video and the example of the latter is forecasting multiple diverse continuations of human motion given partially observed one. In particular,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Anomaly Detection Techniques and Applications
