How Low Can We Go? Pixel Annotation for Semantic Segmentation
Daniel Kigli, Ariel Shamir, Shai Avidan

TL;DR
This paper demonstrates that for semantic segmentation, less than 0.1% of pixel annotations per image are needed to train a highly accurate model, significantly reducing labeling effort.
Contribution
It introduces a method to train semantic segmentation models with minimal pixel annotations using active learning and pseudo-labeling, reducing annotation costs.
Findings
Less than 0.1% pixel annotation needed per image
Models achieve over 98% accuracy with minimal labels
Significant reduction in annotation effort for large datasets
Abstract
How many labeled pixels are needed to segment an image, without any prior knowledge? We conduct an experiment to answer this question. In our experiment, an Oracle is using Active Learning to train a network from scratch. The Oracle has access to the entire label map of the image, but the goal is to reveal as little pixel labels to the network as possible. We find that, on average, the Oracle needs to reveal (i.e., annotate) less than 0.1% of the pixels in order to train a network. The network can then label all pixels in the image at an accuracy of more than 98%. Based on this single-image-annotation experiment, we design an experiment to quickly annotate an entire data set. In the data set level experiment the Oracle trains a new network for each image from scratch. The network can then be used to create pseudo-labels, which are the network predicted labels of the unlabeled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques
