Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach
Yuxiang Huang, Yuhao Chen, John Zelek

TL;DR
This paper introduces a zero-shot dense motion segmentation method that combines optical flow, pseudo depth, and foundation models to effectively segment motion in monocular videos without training.
Contribution
It presents a hybrid approach that leverages traditional optical flow and monocular depth estimation, eliminating the need for training data in motion segmentation.
Findings
Outperforms existing unsupervised methods on benchmark datasets.
Closely matches the performance of supervised methods.
Effectively handles complex scenes with depth variations.
Abstract
Motion segmentation from a single moving camera presents a significant challenge in the field of computer vision. This challenge is compounded by the unknown camera movements and the lack of depth information of the scene. While deep learning has shown impressive capabilities in addressing these issues, supervised models require extensive training on massive annotated datasets, and unsupervised models also require training on large volumes of unannotated data, presenting significant barriers for both. In contrast, traditional methods based on optical flow do not require training data, however, they often fail to capture object-level information, leading to over-segmentation or under-segmentation. In addition, they also struggle in complex scenes with substantial depth variations and non-rigid motion, due to the overreliance of optical flow. To overcome these challenges, we propose an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Image and Object Detection Techniques
