Learning to Better Segment Objects from Unseen Classes with Unlabeled   Videos

Yuming Du; Yang Xiao; Vincent Lepetit

arXiv:2104.12276·cs.CV·August 24, 2021

Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos

Yuming Du, Yang Xiao, Vincent Lepetit

PDF

Open Access

TL;DR

This paper presents a Bayesian approach to automatically generate training data from unlabeled videos, significantly improving segmentation of unseen object classes and enabling open-world instance segmentation.

Contribution

The paper introduces a novel Bayesian method that creates high-quality training sets from unlabeled videos for unseen class segmentation, outperforming existing video segmentation techniques.

Findings

01

Generated training sets improve unseen class segmentation performance

02

Method outperforms existing video segmentation approaches

03

Enables open-world instance segmentation using Internet videos

Abstract

The ability to localize and segment objects from unseen classes would open the door to new applications, such as autonomous object learning in active vision. Nonetheless, improving the performance on unseen classes requires additional training data, while manually annotating the objects of the unseen classes can be labor-extensive and expensive. In this paper, we explore the use of unlabeled video sequences to automatically generate training data for objects of unseen classes. It is in principle possible to apply existing video segmentation methods to unlabeled videos and automatically obtain object masks, which can then be used as a training set even for classes with no manual labels available. However, our experiments show that these methods do not perform well enough for this purpose. We therefore introduce a Bayesian method that is specifically designed to automatically create such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques