A Perceptual Quality Metric for Video Frame Interpolation
Qiqi Hou, Abhijay Ghildyal, Feng Liu

TL;DR
This paper introduces a new perceptual quality metric specifically designed for evaluating video frame interpolation results, leveraging spatio-temporal features and trained on a dedicated dataset, outperforming existing metrics.
Contribution
The paper proposes a novel video-specific perceptual quality metric using spatio-temporal features and a new dataset, improving correlation with human perception.
Findings
Outperforms state-of-the-art quality metrics on video frame interpolation evaluation.
Utilizes spatio-temporal modules based on Swin Transformer for better feature extraction.
Provides publicly available code and dataset for further research.
Abstract
Research on video frame interpolation has made significant progress in recent years. However, existing methods mostly use off-the-shelf metrics to measure the quality of interpolation results with the exception of a few methods that employ user studies, which is time-consuming. As video frame interpolation results often exhibit unique artifacts, existing quality metrics sometimes are not consistent with human perception when measuring the interpolation results. Some recent deep learning-based perceptual quality metrics are shown more consistent with human judgments, but their performance on videos is compromised since they do not consider temporal information. In this paper, we present a dedicated perceptual quality metric for measuring video frame interpolation results. Our method learns perceptual features directly from videos instead of individual frames. It compares pyramid features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Image Processing Techniques · Image Enhancement Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Softmax · Byte Pair Encoding · Stochastic Depth · Adam · Dense Connections · Absolute Position Encodings
