Generalizable Implicit Motion Modeling for Video Frame Interpolation

Zujin Guo; Wei Li; Chen Change Loy

arXiv:2407.08680·cs.CV·February 11, 2025

Generalizable Implicit Motion Modeling for Video Frame Interpolation

Zujin Guo, Wei Li, Chen Change Loy

PDF

Open Access

TL;DR

This paper introduces GIMM, a novel motion modeling approach for video frame interpolation that effectively captures spatiotemporal dynamics by encoding motion priors and predicting flows with neural networks, outperforming current methods.

Contribution

GIMM is a new implicit motion modeling paradigm that encodes motion priors and predicts arbitrary-timestep flows, enhancing VFI performance.

Findings

01

GIMM outperforms state-of-the-art methods on standard benchmarks.

02

GIMM effectively models complex spatiotemporal motion dynamics.

03

The approach integrates seamlessly with existing flow-based VFI models.

Abstract

Motion modeling is critical in flow-based Video Frame Interpolation (VFI). Existing paradigms either consider linear combinations of bidirectional flows or directly predict bilateral flows for given timestamps without exploring favorable motion priors, thus lacking the capability of effectively modeling spatiotemporal dynamics in real-world videos. To address this limitation, in this study, we introduce Generalizable Implicit Motion Modeling (GIMM), a novel and effective approach to motion modeling for VFI. Specifically, to enable GIMM as an effective motion modeling paradigm, we design a motion encoding pipeline to model spatiotemporal motion latent from bidirectional flows extracted from pre-trained flow estimators, effectively representing input-specific motion priors. Then, we implicitly predict arbitrary-timestep optical flows within two adjacent input frames via an adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Video Coding and Compression Technologies