What's the Point: Semantic Segmentation with Point Supervision

Amy Bearman; Olga Russakovsky; Vittorio Ferrari; Li Fei-Fei

arXiv:1506.02106·cs.CV·July 26, 2016·47 cites

What's the Point: Semantic Segmentation with Point Supervision

Amy Bearman, Olga Russakovsky, Vittorio Ferrari, Li Fei-Fei

PDF

Open Access 1 Repo

TL;DR

This paper explores using point-level annotations for semantic segmentation, combining them with an objectness potential in training CNNs, resulting in improved accuracy over weaker supervision methods.

Contribution

It introduces a novel training loss incorporating point supervision and objectness, demonstrating significant accuracy gains on PASCAL VOC 2012.

Findings

01

12.9% mIOU improvement over image-level supervision

02

Point supervision outperforms squiggle and full supervision at fixed annotation budget

03

Models trained with point supervision achieve higher accuracy

Abstract

The semantic image segmentation task presents a trade-off between test time accuracy and training-time annotation cost. Detailed per-pixel annotations enable training accurate models but are very time-consuming to obtain, image-level class labels are an order of magnitude cheaper but result in less accurate models. We take a natural step from image-level annotation towards stronger supervision: we ask annotators to point to an object if one exists. We incorporate this point supervision along with a novel objectness potential in the training loss function of a CNN model. Experimental results on the PASCAL VOC 2012 benchmark reveal that the combined effect of point-level supervision and objectness potential yields an improvement of 12.9% mIOU over image-level supervision. Further, we demonstrate that models trained with point-level supervision are more accurate than models trained with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

abearman/whats-the-point1
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

Methods1-Dimensional Convolutional Neural Networks