Exposing Semantic Segmentation Failures via Maximum Discrepancy Competition
Jiebin Yan, Yu Zhong, Yuming Fang, Zhangyang Wang, Kede Ma

TL;DR
This paper introduces a method to expose failures of semantic segmentation algorithms in real-world scenarios by selecting images that maximize the discrepancy between methods, revealing their weaknesses with minimal labeling effort.
Contribution
It proposes a discrepancy-based image sampling technique to identify and analyze the failure modes of semantic segmentation models in open-world conditions.
Findings
Identified strengths and weaknesses of ten PASCAL VOC segmentation algorithms.
Demonstrated the effectiveness of MAD in exposing model failures.
Provided insights for future semantic segmentation research.
Abstract
Semantic segmentation is an extensively studied task in computer vision, with numerous methods proposed every year. Thanks to the advent of deep learning in semantic segmentation, the performance on existing benchmarks is close to saturation. A natural question then arises: Does the superior performance on the closed (and frequently re-used) test sets transfer to the open visual world with unconstrained variations? In this paper, we take steps toward answering the question by exposing failures of existing semantic segmentation methods in the open visual world under the constraint of very limited human labeling effort. Inspired by previous research on model falsification, we start from an arbitrarily large image set, and automatically sample a small image set by MAximizing the Discrepancy (MAD) between two segmentation methods. The selected images have the greatest potential in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
