Semantic Segmentation by Semantic Proportions
Halil Ibrahim Aysel, Xiaohao Cai, Adam Pr\"ugel-Bennett

TL;DR
This paper introduces a novel semantic segmentation approach that uses rough class proportion information instead of detailed pixel-level annotations, significantly reducing annotation effort and storage needs while maintaining competitive performance.
Contribution
The paper proposes a new method leveraging semantic proportions for segmentation, enabling training with less detailed annotations and serving as a plug-and-play module for existing models.
Findings
Achieves competitive segmentation performance with semantic proportions.
Reduces annotation time, cost, and storage space.
Can enhance existing segmentation models without added complexity.
Abstract
Semantic segmentation is a critical task in computer vision aiming to identify and classify individual pixels in an image, with numerous applications in for example autonomous driving and medical image analysis. However, semantic segmentation can be highly challenging particularly due to the need for large amounts of annotated data. Annotating images is a time-consuming and costly process, often requiring expert knowledge and significant effort; moreover, saving the annotated images could dramatically increase the storage space. In this paper, we propose a novel approach for semantic segmentation, requiring the rough information of individual semantic class proportions, shortened as semantic proportions, rather than the necessity of ground-truth segmentation maps. This greatly simplifies the data annotation process and thus will significantly reduce the annotation time, cost and storage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
