Towards Closing the Gap in Weakly Supervised Semantic Segmentation with DCNNs: Combining Local and Global Models
Christoph Mayer, Radu Timofte, Gr\'egory Paul

TL;DR
This paper introduces a strategy to narrow the performance gap in weakly supervised semantic segmentation by combining local and global models, achieving state-of-the-art results with scribble annotations.
Contribution
It presents a novel approach that averages local predictions with baseline annotations, systematically reducing the performance gap in weakly supervised segmentation.
Findings
Achieved a mIoU of 75.6% without CRF post-processing.
Reduced the gap by 64.2%, surpassing current state-of-the-art.
Unveiled a simple, generalizable mechanism for weakly supervised learning.
Abstract
Generating training sets for deep convolutional neural networks (DCNNs) is a bottleneck for modern real-world applications. This is a demanding task for applications where annotating training data is costly, such as in semantic segmentation. In the literature, there is still a gap between the performance achieved by a network trained on full and on weak annotations. In this paper, we establish a strategy to measure this gap and to identify the ingredients necessary to reduce it. On scribbles, we establish new state-of-the-art results: we obtain a mIoU of 75.6% without, and 75.7% with CRF post-processing. We reduce the gap by 64.2% whereas the current state-of-the-art reduces it only by 57.5%. Thanks to a systematic study of the different ingredients involved in the weakly supervised scenario and an original experimental strategy, we unravel a counter-intuitive mechanism that is simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion-Convolutional Neural Networks · Conditional Random Field
