Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation
Kun Zhou, Wenbo Li, Xiaoguang Han, Jiangbo Lu

TL;DR
This paper introduces a texture consistency loss and a cross-scale pyramid alignment module to improve video frame interpolation, addressing motion ambiguity and computational efficiency issues in existing methods.
Contribution
It proposes a novel texture consistency loss and an efficient cross-scale pyramid alignment module to enhance VFI quality and speed, overcoming limitations of previous approaches.
Findings
Texture consistency loss improves interpolation quality.
CSPA module reduces computational complexity to O(N).
Experimental results show enhanced performance and efficiency.
Abstract
For video frame interpolation (VFI), existing deep-learning-based approaches strongly rely on the ground-truth (GT) intermediate frames, which sometimes ignore the non-unique nature of motion judging from the given adjacent frames. As a result, these methods tend to produce averaged solutions that are not clear enough. To alleviate this issue, we propose to relax the requirement of reconstructing an intermediate frame as close to the GT as possible. Towards this end, we develop a texture consistency loss (TCL) upon the assumption that the interpolated content should maintain similar structures with their counterparts in the given frames. Predictions satisfying this constraint are encouraged, though they may differ from the pre-defined GT. Without the bells and whistles, our plug-and-play TCL is capable of improving the performance of existing VFI frameworks. On the other hand, previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Image Processing Techniques and Applications
