TTVFI: Learning Trajectory-Aware Transformer for Video Frame   Interpolation

Chengxu Liu; Huan Yang; Jianlong Fu; Xueming Qian

arXiv:2207.09048·cs.CV·July 20, 2022·1 cites

TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation

Chengxu Liu, Huan Yang, Jianlong Fu, Xueming Qian

PDF

Open Access

TL;DR

This paper introduces TTVFI, a trajectory-aware transformer that improves video frame interpolation by better aligning features along motion trajectories, leading to more accurate and less blurred intermediate frames.

Contribution

The paper presents a novel transformer-based approach that models motion trajectories for improved feature alignment in video frame interpolation.

Findings

01

Outperforms state-of-the-art methods on four benchmarks.

02

Effectively handles large and complex motions.

03

Produces sharper, more accurate intermediate frames.

Abstract

Video frame interpolation (VFI) aims to synthesize an intermediate frame between two consecutive frames. State-of-the-art approaches usually adopt a two-step solution, which includes 1) generating locally-warped pixels by flow-based motion estimations, 2) blending the warped pixels to form a full frame through deep neural synthesis networks. However, due to the inconsistent warping from the two consecutive frames, the warped features for new frames are usually not aligned, which leads to distorted and blurred frames, especially when large and complex motions occur. To solve this issue, in this paper we propose a novel Trajectory-aware Transformer for Video Frame Interpolation (TTVFI). In particular, we formulate the warped features with inconsistent motions as query tokens, and formulate relevant regions in a motion trajectory from two original consecutive frames into keys and values.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Processing Techniques and Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Dropout · Adam · Residual Connection · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing