TL;DR
This study shows that using coarse ground truth annotations can maintain or even improve semantic image segmentation accuracy for certain classes in urban scene datasets, potentially simplifying dataset preparation.
Contribution
The paper demonstrates that coarse annotations can be effectively used for training and prediction in semantic segmentation, reducing annotation effort without sacrificing accuracy.
Findings
Coarse GT annotations can outperform fine annotations for key classes.
Standard deviation of accuracy is lower with coarse annotations, indicating more consistent performance.
Using coarse annotations enables faster dataset preparation and model tuning.
Abstract
Preparation of high-quality datasets for the urban scene understanding is a labor-intensive task, especially, for datasets designed for the autonomous driving applications. The application of the coarse ground truth (GT) annotations of these datasets without detriment to the accuracy of semantic image segmentation (by the mean intersection over union - mIoU) could simplify and speedup the dataset preparation and model fine tuning before its practical application. Here the results of the comparative analysis for semantic segmentation accuracy obtained by PSPNet deep learning architecture are presented for fine and coarse annotated images from Cityscapes dataset. Two scenarios were investigated: scenario 1 - the fine GT images for training and prediction, and scenario 2 - the fine GT images for training and the coarse GT images for prediction. The obtained results demonstrated that for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Convolution · Average Pooling · Pyramid Pooling Module · Auxiliary Classifier · Dilated Convolution · PSPNet
