Skin the sheep not only once: Reusing Various Depth Datasets to Drive the Learning of Optical Flow
Sheng-Chi Huang, Wei-Chen Chiu

TL;DR
This paper introduces a novel approach to improve optical flow estimation by reusing diverse real-world depth datasets through geometric transformations and augmentation, enhancing training data diversity and model robustness.
Contribution
It unifies various depth datasets for optical flow training by synthesizing virtual disparities and camera motions, and employs geometric augmentations with an auxiliary classifier to boost learning.
Findings
Enhanced optical flow accuracy across multiple datasets
Improved generalization of flow estimators with augmented training
Method is effective across various models and data sources
Abstract
Optical flow estimation is crucial for various applications in vision and robotics. As the difficulty of collecting ground truth optical flow in real-world scenarios, most of the existing methods of learning optical flow still adopt synthetic dataset for supervised training or utilize photometric consistency across temporally adjacent video frames to drive the unsupervised learning, where the former typically has issues of generalizability while the latter usually performs worse than the supervised ones. To tackle such challenges, we propose to leverage the geometric connection between optical flow estimation and stereo matching (based on the similarity upon finding pixel correspondences across images) to unify various real-world depth estimation datasets for generating supervised training data upon optical flow. Specifically, we turn the monocular depth datasets into stereo ones via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques
MethodsAuxiliary Classifier
