# FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using   Stochastic Inference

**Authors:** Jungbeom Lee, Eunji Kim, Sungmin Lee, Jangho Lee, Sungroh Yoon

arXiv: 1902.10421 · 2019-03-05

## TL;DR

FickleNet introduces a stochastic inference method that enhances weakly and semi-supervised semantic segmentation by generating diverse localization maps from a single network without extra training, outperforming existing methods.

## Contribution

It proposes a novel stochastic inference approach that improves localization in weakly supervised segmentation by implicitly learning object coherence, without additional training.

## Key findings

- Outperforms recent techniques on Pascal VOC 2012 benchmark
- Generates diverse localization maps from a single network
- Does not require extra training steps

## Abstract

The main obstacle to weakly supervised semantic image segmentation is the difficulty of obtaining pixel-level information from coarse image-level annotations. Most methods based on image-level annotations use localization maps obtained from the classifier, but these only focus on the small discriminative parts of objects and do not capture precise boundaries. FickleNet explores diverse combinations of locations on feature maps created by generic deep neural networks. It selects hidden units randomly and then uses them to obtain activation scores for image classification. FickleNet implicitly learns the coherence of each location in the feature maps, resulting in a localization map which identifies both discriminative and other parts of objects. The ensemble effects are obtained from a single network by selecting random hidden unit pairs, which means that a variety of localization maps are generated from a single image. Our approach does not require any additional training steps and only adds a simple layer to a standard convolutional neural network; nevertheless it outperforms recent comparable techniques on the Pascal VOC 2012 benchmark in both weakly and semi-supervised settings.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.10421/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1902.10421/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/1902.10421/full.md

---
Source: https://tomesphere.com/paper/1902.10421