# Predicting Foreground Object Ambiguity and Efficiently Crowdsourcing the   Segmentation(s)

**Authors:** Danna Gurari, Kun He, Bo Xiong, Jianming Zhang, Mehrnoosh, Sameki, Suyog Dutt Jain, Stan Sclaroff, Margrit Betke, Kristen, Grauman

arXiv: 1705.00366 · 2017-05-02

## TL;DR

This paper introduces the ambiguity problem in foreground object segmentation, creating a new dataset and a system to predict ambiguous images, which improves annotation efficiency and reduces crowdsourcing effort without losing segmentation diversity.

## Contribution

The paper defines foreground ambiguity, constructs the STATIC dataset, and develops a predictive system to identify ambiguous images, enhancing crowdsourcing efficiency.

## Key findings

- The prediction system outperforms saliency-based methods.
- It reduces crowdsourcing effort by up to 47%.
- It maintains segmentation diversity while saving costs.

## Abstract

We propose the ambiguity problem for the foreground object segmentation task and motivate the importance of estimating and accounting for this ambiguity when designing vision systems. Specifically, we distinguish between images which lead multiple annotators to segment different foreground objects (ambiguous) versus minor inter-annotator differences of the same object. Taking images from eight widely used datasets, we crowdsource labeling the images as "ambiguous" or "not ambiguous" to segment in order to construct a new dataset we call STATIC. Using STATIC, we develop a system that automatically predicts which images are ambiguous. Experiments demonstrate the advantage of our prediction system over existing saliency-based methods on images from vision benchmarks and images taken by blind people who are trying to recognize objects in their environment. Finally, we introduce a crowdsourcing system to achieve cost savings for collecting the diversity of all valid "ground truth" foreground object segmentations by collecting extra segmentations only when ambiguity is expected. Experiments show our system eliminates up to 47% of human effort compared to existing crowdsourcing methods with no loss in capturing the diversity of ground truths.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.00366/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1705.00366/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/1705.00366/full.md

---
Source: https://tomesphere.com/paper/1705.00366