Video Frame Interpolation via Structure-Motion based Iterative Fusion

Xi Li; Meng Cao; Yingying Tang; Scott Johnston; Zhendong Hong; Huimin; Ma; Jiulong Shan

arXiv:2105.05353·cs.CV·May 13, 2021·1 cites

Video Frame Interpolation via Structure-Motion based Iterative Fusion

Xi Li, Meng Cao, Yingying Tang, Scott Johnston, Zhendong Hong, Huimin, Ma, Jiulong Shan

PDF

Open Access

TL;DR

This paper introduces a novel structure-motion iterative fusion approach for video frame interpolation, combining the strengths of optical flow and kernel-based methods to produce sharper, more accurate interpolated frames with enhanced structural and motion consistency.

Contribution

It proposes an end-to-end learnable framework that fuses structure-based and motion-based interpolation with iterative refinement and saliency-aware evaluation, improving performance with less training data.

Findings

01

Outperforms state-of-the-art methods on three benchmarks.

02

Achieves superior metrics with only one-tenth of the training data.

03

Incorporates saliency masks to better handle foreground and background objects.

Abstract

Video Frame Interpolation synthesizes non-existent images between adjacent frames, with the aim of providing a smooth and consistent visual experience. Two approaches for solving this challenging task are optical flow based and kernel-based methods. In existing works, optical flow based methods can provide accurate point-to-point motion description, however, they lack constraints on object structure. On the contrary, kernel-based methods focus on structural alignment, which relies on semantic and apparent features, but tends to blur results. Based on these observations, we propose a structure-motion based iterative fusion method. The framework is an end-to-end learnable structure with two stages. First, interpolated frames are synthesized by structure-based and motion-based learning branches respectively, then, an iterative refinement module is established via spatial and temporal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Processing Techniques and Applications