Weakly Supervised Semantic Segmentation using Web-Crawled Videos
Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han

TL;DR
This paper introduces a weakly supervised semantic segmentation method that leverages web-crawled videos to generate pseudo-labels, significantly improving segmentation accuracy without extra human annotation.
Contribution
It presents a novel approach that uses web videos and discriminative localization to generate effective segmentation labels from only image-level supervision.
Findings
Outperforms existing weakly supervised methods
Achieves results comparable to methods with additional annotations
Effectively identifies relevant spatio-temporal video segments
Abstract
We propose a novel algorithm for weakly supervised semantic segmentation based on image-level class labels only. In weakly supervised setting, it is commonly observed that trained model overly focuses on discriminative parts rather than the entire object area. Our goal is to overcome this limitation with no additional human intervention by retrieving videos relevant to target class labels from web repository, and generating segmentation labels from the retrieved videos to simulate strong supervision for semantic segmentation. During this process, we take advantage of image classification with discriminative localization technique to reject false alarms in retrieved videos and identify relevant spatio-temporal volumes within retrieved videos. Although the entire procedure does not require any additional supervision, the segmentation annotations obtained from videos are sufficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
