What's the Point: Semantic Segmentation with Point Supervision
Amy Bearman, Olga Russakovsky, Vittorio Ferrari, Li Fei-Fei

TL;DR
This paper explores using point-level annotations for semantic segmentation, combining them with an objectness potential in training CNNs, resulting in improved accuracy over weaker supervision methods.
Contribution
It introduces a novel training loss incorporating point supervision and objectness, demonstrating significant accuracy gains on PASCAL VOC 2012.
Findings
12.9% mIOU improvement over image-level supervision
Point supervision outperforms squiggle and full supervision at fixed annotation budget
Models trained with point supervision achieve higher accuracy
Abstract
The semantic image segmentation task presents a trade-off between test time accuracy and training-time annotation cost. Detailed per-pixel annotations enable training accurate models but are very time-consuming to obtain, image-level class labels are an order of magnitude cheaper but result in less accurate models. We take a natural step from image-level annotation towards stronger supervision: we ask annotators to point to an object if one exists. We incorporate this point supervision along with a novel objectness potential in the training loss function of a CNN model. Experimental results on the PASCAL VOC 2012 benchmark reveal that the combined effect of point-level supervision and objectness potential yields an improvement of 12.9% mIOU over image-level supervision. Further, we demonstrate that models trained with point-level supervision are more accurate than models trained with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
Methods1-Dimensional Convolutional Neural Networks
