Skin the sheep not only once: Reusing Various Depth Datasets to Drive   the Learning of Optical Flow

Sheng-Chi Huang; Wei-Chen Chiu

arXiv:2310.01833·cs.CV·October 4, 2023

Skin the sheep not only once: Reusing Various Depth Datasets to Drive the Learning of Optical Flow

Sheng-Chi Huang, Wei-Chen Chiu

PDF

Open Access

TL;DR

This paper introduces a novel approach to improve optical flow estimation by reusing diverse real-world depth datasets through geometric transformations and augmentation, enhancing training data diversity and model robustness.

Contribution

It unifies various depth datasets for optical flow training by synthesizing virtual disparities and camera motions, and employs geometric augmentations with an auxiliary classifier to boost learning.

Findings

01

Enhanced optical flow accuracy across multiple datasets

02

Improved generalization of flow estimators with augmented training

03

Method is effective across various models and data sources

Abstract

Optical flow estimation is crucial for various applications in vision and robotics. As the difficulty of collecting ground truth optical flow in real-world scenarios, most of the existing methods of learning optical flow still adopt synthetic dataset for supervised training or utilize photometric consistency across temporally adjacent video frames to drive the unsupervised learning, where the former typically has issues of generalizability while the latter usually performs worse than the supervised ones. To tackle such challenges, we propose to leverage the geometric connection between optical flow estimation and stereo matching (based on the similarity upon finding pixel correspondences across images) to unify various real-world depth estimation datasets for generating supervised training data upon optical flow. Specifically, we turn the monocular depth datasets into stereo ones via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques

MethodsAuxiliary Classifier