Discovering Objects that Can Move
Zhipeng Bao, Pavel Tokmakov, Allan Jabri, Yu-Xiong Wang, Adrien, Gaidon, Martial Hebert

TL;DR
This paper presents a scalable unsupervised method for object discovery in complex scenes by leveraging motion cues, focusing on dynamic objects to improve segmentation without manual labels.
Contribution
It introduces a simplified auto-encoder framework augmented with motion segmentation signals, enabling effective discovery of moving and static dynamic objects in real-world scenes.
Findings
Outperforms heuristic and learning-based methods on KITTI benchmark.
Successfully scales to complex street driving scenarios.
Generalizes from moving objects to static instances.
Abstract
This paper studies the problem of object discovery -- separating objects from the background without manual labels. Existing approaches utilize appearance cues, such as color, texture, and location, to group pixels into object-like regions. However, by relying on appearance alone, these methods fail to separate objects from the background in cluttered scenes. This is a fundamental limitation since the definition of an object is inherently ambiguous and context-dependent. To resolve this ambiguity, we choose to focus on dynamic objects -- entities that can move independently in the world. We then scale the recent auto-encoder based frameworks for unsupervised object discovery from toy synthetic images to complex real-world scenes. To this end, we simplify their architecture, and augment the resulting model with a weak learning signal from general motion segmentation algorithms. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Vision and Imaging
