EM-driven unsupervised learning for efficient motion segmentation
Etienne Meunier, Ana\"is Badoual, and Patrick Bouthemy

TL;DR
This paper introduces an unsupervised CNN-based method for motion segmentation using an EM framework, capable of segmenting multiple motions efficiently without ground-truth data, and demonstrates strong results on multiple benchmarks.
Contribution
It proposes a novel EM-driven training approach for motion segmentation that does not require annotations and can handle multiple motions efficiently.
Findings
Performed well on DAVIS2016, SegTrackV2, FBMS59, and MoCA benchmarks.
Achieved fast inference without estimating motion models.
Introduced a robust loss function and data augmentation technique.
Abstract
In this paper, we present a CNN-based fully unsupervised method for motion segmentation from optical flow. We assume that the input optical flow can be represented as a piecewise set of parametric motion models, typically, affine or quadratic motion models. The core idea of our work is to leverage the Expectation-Maximization (EM) framework in order to design in a well-founded manner a loss function and a training procedure of our motion segmentation neural network that does not require either ground-truth or manual annotation. However, in contrast to the classical iterative EM, once the network is trained, we can provide a segmentation for any unseen optical flow field in a single inference step and without estimating any motion models. We investigate different loss functions including robust ones and propose a novel efficient data augmentation technique on the optical flow field,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Human Pose and Action Recognition
