Imposing Consistency for Optical Flow Estimation
Jisoo Jeong, Jamie Menjay Lin, Fatih Porikli, Nojun Kwak

TL;DR
This paper introduces novel consistency strategies for optical flow estimation, improving self-supervised and semi-supervised learning without additional annotations, and achieves state-of-the-art results on the KITTI-2015 benchmark.
Contribution
It proposes occlusion consistency and zero forcing techniques for optical flow, enhancing pixel motion description in a semi-supervised manner without extra labels.
Findings
Achieves state-of-the-art performance on KITTI-2015 scene flow benchmark.
Attains the best foreground accuracy (4.33% Fl-all) with monocular inputs.
Improves optical flow estimation through novel consistency strategies.
Abstract
Imposing consistency through proxy tasks has been shown to enhance data-driven learning and enable self-supervision in various tasks. This paper introduces novel and effective consistency strategies for optical flow estimation, a problem where labels from real-world data are very challenging to derive. More specifically, we propose occlusion consistency and zero forcing in the forms of self-supervised learning and transformation consistency in the form of semi-supervised learning. We apply these consistency techniques in a way that the network model learns to describe pixel-level motions better while requiring no additional annotations. We demonstrate that our consistency strategies applied to a strong baseline network model using the original datasets and labels provide further improvements, attaining the state-of-the-art results on the KITTI-2015 scene flow benchmark in the non-stereo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques
