Benefits of Feature Extraction and Temporal Sequence Analysis for Video Frame Prediction: An Evaluation of Hybrid Deep Learning Models
Jose M. S\'anchez Vel\'azquez, Mingbo Cai, Andrew Coney, \'Alvaro J. Garc\'ia- Tejedor, Alberto Nogales

TL;DR
This paper evaluates hybrid deep learning models combining feature extraction and temporal analysis for video frame prediction, demonstrating improved SSIM metrics and highlighting the effectiveness of 3DCNNs and ConvLSTMs, especially on greyscale real-world videos.
Contribution
It introduces and assesses hybrid deep learning architectures that integrate autoencoders with RNNs and 3D CNNs for enhanced video frame prediction.
Findings
Hybrid models with 3DCNNs and ConvLSTMs outperform others.
Greyscale real-world videos are easier to predict.
SSIM improved from 0.69 to 0.82.
Abstract
In recent years, advances in Artificial Intelligence have significantly impacted computer science, particularly in the field of computer vision, enabling solutions to complex problems such as video frame prediction. Video frame prediction has critical applications in weather forecasting or autonomous systems and can provide technical improvements, such as video compression and streaming. Among Artificial Intelligence methods, Deep Learning has emerged as highly effective for solving vision-related tasks, although current frame prediction models still have room for enhancement. This paper evaluates several hybrid deep learning approaches that combine the feature extraction capabilities of autoencoders with temporal sequence modelling using Recurrent Neural Networks (RNNs), 3D Convolutional Neural Networks (3D CNNs), and related architectures. The proposed solutions were rigorously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Image Processing Techniques · Image Enhancement Techniques
