TL;DR
This paper introduces a unified unsupervised framework called Competitive Collaboration that jointly learns depth, camera motion, optical flow, and scene segmentation by leveraging geometric constraints, improving performance across all tasks.
Contribution
The paper presents a novel Competitive Collaboration framework that enables simultaneous unsupervised learning of multiple interconnected low-level vision tasks with explicit scene segmentation.
Findings
Achieves state-of-the-art results on all sub-problems in unsupervised settings.
Effectively segments scenes into static and moving regions.
Joint learning improves the accuracy of depth, motion, and flow estimation.
Abstract
We address the unsupervised learning of several interconnected problems in low-level vision: single view depth prediction, camera motion estimation, optical flow, and segmentation of a video into the static scene and moving regions. Our key insight is that these four fundamental vision problems are coupled through geometric constraints. Consequently, learning to solve them together simplifies the problem because the solutions can reinforce each other. We go beyond previous work by exploiting geometry more explicitly and segmenting the scene into static and moving regions. To that end, we introduce Competitive Collaboration, a framework that facilitates the coordinated training of multiple specialized neural networks to solve complex problems. Competitive Collaboration works much like expectation-maximization, but with neural networks that act as both competitors to explain pixels that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
