Unsupervised Object Segmentation by Redrawing
Micka\"el Chen, Thierry Arti\`eres, Ludovic Denoyer

TL;DR
This paper introduces ReDO, an unsupervised model for object segmentation that extracts and redraws objects in images without requiring annotated masks, leveraging adversarial training to match the original dataset distribution.
Contribution
ReDO is a novel unsupervised object segmentation method that does not need pixel-level annotations, using adversarial learning to ensure realistic object extraction and redrawing.
Findings
Effective object masks extracted without supervision
High-quality redrawing of objects maintaining dataset distribution
Applicable across different datasets
Abstract
Object segmentation is a crucial problem that is usually solved by using supervised learning approaches over very large datasets composed of both images and corresponding object masks. Since the masks have to be provided at pixel level, building such a dataset for any new domain can be very time-consuming. We present ReDO, a new model able to extract objects from images without any annotation in an unsupervised way. It relies on the idea that it should be possible to change the textures or colors of the objects without changing the overall distribution of the dataset. Following this assumption, our approach is based on an adversarial architecture where the generator is guided by an input sample: given an image, it extracts the object mask, then redraws a new object at the same location. The generator is controlled by a discriminator that ensures that the distribution of generated images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis
