Can Image-Level Labels Replace Pixel-Level Labels for Image Parsing
Zhiwu Lu, Zhenyong Fu, Tao Xiang, Liwei Wang, and Ji-Rong Wen

TL;DR
This paper proposes a weakly supervised sparse learning method for image parsing using only noisy image-level labels, demonstrating that such labels can effectively replace pixel-level annotations.
Contribution
It introduces a novel sparse learning framework for noisily tagged image parsing that relies solely on image-level labels, reducing the need for pixel-level supervision.
Findings
Effective parsing with noisy image-level labels
Outperforms traditional pixel-label based methods
Provides insights into label supervision for image parsing
Abstract
This paper presents a weakly supervised sparse learning approach to the problem of noisily tagged image parsing, or segmenting all the objects within a noisily tagged image and identifying their categories (i.e. tags). Different from the traditional image parsing that takes pixel-level labels as strong supervisory information, our noisily tagged image parsing is provided with noisy tags of all the images (i.e. image-level labels), which is a natural setting for social image collections (e.g. Flickr). By oversegmenting all the images into regions, we formulate noisily tagged image parsing as a weakly supervised sparse learning problem over all the regions, where the initial labels of each region are inferred from image-level labels. Furthermore, we develop an efficient algorithm to solve such weakly supervised sparse learning problem. The experimental results on two benchmark datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
