RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

Zachary Teed; Jia Deng

arXiv:2003.12039·cs.CV·August 26, 2020·137 cites

RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

Zachary Teed, Jia Deng

PDF

Open Access 5 Repos 2 Models

TL;DR

RAFT introduces a novel recurrent deep network architecture for optical flow that builds multi-scale correlation volumes and iteratively refines flow estimates, achieving state-of-the-art accuracy and efficiency.

Contribution

The paper presents RAFT, a new architecture that significantly improves optical flow estimation by using all-pairs correlation volumes and recurrent updates.

Findings

01

Achieves 5.10% F1-all error on KITTI, 16% better than previous best.

02

Obtains 2.855 pixel end-point-error on Sintel, 30% better than previous best.

03

Demonstrates strong cross-dataset generalization and high efficiency.

Abstract

We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep network architecture for optical flow. RAFT extracts per-pixel features, builds multi-scale 4D correlation volumes for all pairs of pixels, and iteratively updates a flow field through a recurrent unit that performs lookups on the correlation volumes. RAFT achieves state-of-the-art performance. On KITTI, RAFT achieves an F1-all error of 5.10%, a 16% error reduction from the best published result (6.10%). On Sintel (final pass), RAFT obtains an end-point-error of 2.855 pixels, a 30% error reduction from the best published result (4.098 pixels). In addition, RAFT has strong cross-dataset generalization as well as high efficiency in inference time, training speed, and parameter count. Code is available at https://github.com/princeton-vl/RAFT.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques