Self-SuperFlow: Self-supervised Scene Flow Prediction in Stereo Sequences
Katharina Bendig, Ren\'e Schuster, Didier Stricker

TL;DR
Self-SelfFlow introduces a self-supervised approach for scene flow prediction in stereo sequences, leveraging unlabeled real-world data to outperform supervised methods on the KITTI benchmark and enhance generalization.
Contribution
It extends self-supervised loss functions with Census transform and occlusion awareness for scene flow, reducing reliance on dense annotations and improving performance.
Findings
Outperforms supervised pre-training on KITTI
Shows better generalization to real-world data
Achieves faster convergence during training
Abstract
In recent years, deep neural networks showed their exceeding capabilities in addressing many computer vision tasks including scene flow prediction. However, most of the advances are dependent on the availability of a vast amount of dense per pixel ground truth annotations, which are very difficult to obtain for real life scenarios. Therefore, synthetic data is often relied upon for supervision, resulting in a representation gap between the training and test data. Even though a great quantity of unlabeled real world data is available, there is a huge lack in self-supervised methods for scene flow prediction. Hence, we explore the extension of a self-supervised loss based on the Census transform and occlusion-aware bidirectional displacements for the problem of scene flow prediction. Regarding the KITTI scene flow benchmark, our method outperforms the corresponding supervised pre-training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
MethodsTest
