IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Xiaoming Huang,, Ying Tai, Chengjie Wang, Jie Yang

TL;DR
IFRNet is an efficient encoder-decoder network for real-time video frame interpolation, utilizing pyramid features, intermediate flow refinement, and novel loss functions to achieve high performance and speed.
Contribution
The paper introduces IFRNet, a novel fast frame interpolation network with a unique feature refinement process and task-oriented flow distillation, improving efficiency and accuracy.
Findings
Achieves state-of-the-art performance on benchmarks.
Offers fast inference suitable for real-time applications.
Effectively preserves structural details through regularization.
Abstract
Prevailing video frame interpolation algorithms, that generate the intermediate frames from consecutive inputs, typically rely on complex model architectures with heavy parameters or large delay, hindering them from diverse real-time applications. In this work, we devise an efficient encoder-decoder based network, termed IFRNet, for fast intermediate frame synthesizing. It first extracts pyramid features from given inputs, and then refines the bilateral intermediate flow fields together with a powerful intermediate feature until generating the desired output. The gradually refined intermediate feature can not only facilitate intermediate flow estimation, but also compensate for contextual details, making IFRNet do not need additional synthesis or refinement module. To fully release its potential, we further propose a novel task-oriented optical flow distillation loss to focus on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Processing Techniques and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
