Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep   Learning with Geometric Motion Model Fusion

Yuxiang Huang; Yuhao Chen; John Zelek

arXiv:2405.01723·cs.CV·May 6, 2024

Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep Learning with Geometric Motion Model Fusion

Yuxiang Huang, Yuhao Chen, John Zelek

PDF

Open Access

TL;DR

This paper introduces a zero-shot monocular motion segmentation approach that combines deep learning with geometric model fusion, achieving state-of-the-art results without requiring training data.

Contribution

It presents a novel method that integrates deep learning and geometric models for motion segmentation, eliminating the need for extensive training datasets.

Findings

01

Achieves competitive results on multiple datasets.

02

Surpasses some supervised methods without training data.

03

Demonstrates the effectiveness of geometric model fusion.

Abstract

Detecting and segmenting moving objects from a moving monocular camera is challenging in the presence of unknown camera motion, diverse object motions and complex scene structures. Most existing methods rely on a single motion cue to perform motion segmentation, which is usually insufficient when facing different complex environments. While a few recent deep learning based methods are able to combine multiple motion cues to achieve improved accuracy, they depend heavily on vast datasets and extensive annotations, making them less adaptable to new scenarios. To address these limitations, we propose a novel monocular dense segmentation method that achieves state-of-the-art motion segmentation results in a zero-shot manner. The proposed method synergestically combines the strengths of deep learning and geometric model fusion methods by performing geometric model fusion on object proposals.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition