TL;DR
This paper introduces a novel memory alignment approach with long-term motion context memory and query decomposition to improve long-term video prediction accuracy, especially for complex and limited-dynamics sequences.
Contribution
It proposes a memory alignment learning framework with a long-term motion context memory and query decomposition, enhancing long-term video prediction performance.
Findings
Outperforms existing RNN-based methods in long-term prediction tasks.
Memory alignment learning effectively recalls long-term motion context.
Memory query decomposition improves local motion context matching.
Abstract
Our work addresses long-term motion context issues for predicting future frames. To predict the future precisely, it is required to capture which long-term motion context (e.g., walking or running) the input motion (e.g., leg movement) belongs to. The bottlenecks arising when dealing with the long-term motion context are: (i) how to predict the long-term motion context naturally matching input sequences with limited dynamics, (ii) how to predict the long-term motion context with high-dimensionality (e.g., complex motion). To address the issues, we propose novel motion context-aware video prediction. To solve the bottleneck (i), we introduce a long-term motion context memory (LMC-Memory) with memory alignment learning. The proposed memory alignment learning enables to store long-term motion contexts into the memory and to match them with sequences including limited dynamics. As a result,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
