Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos
Yuming Du, Yang Xiao, Vincent Lepetit

TL;DR
This paper presents a Bayesian approach to automatically generate training data from unlabeled videos, significantly improving segmentation of unseen object classes and enabling open-world instance segmentation.
Contribution
The paper introduces a novel Bayesian method that creates high-quality training sets from unlabeled videos for unseen class segmentation, outperforming existing video segmentation techniques.
Findings
Generated training sets improve unseen class segmentation performance
Method outperforms existing video segmentation approaches
Enables open-world instance segmentation using Internet videos
Abstract
The ability to localize and segment objects from unseen classes would open the door to new applications, such as autonomous object learning in active vision. Nonetheless, improving the performance on unseen classes requires additional training data, while manually annotating the objects of the unseen classes can be labor-extensive and expensive. In this paper, we explore the use of unlabeled video sequences to automatically generate training data for objects of unseen classes. It is in principle possible to apply existing video segmentation methods to unlabeled videos and automatically obtain object masks, which can then be used as a training set even for classes with no manual labels available. However, our experiments show that these methods do not perform well enough for this purpose. We therefore introduce a Bayesian method that is specifically designed to automatically create such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
