Detection and Mitigation of Rare Subclasses in Deep Neural Network Classifiers
Colin Paterson, Radu Calinescu, Chiara Picardi

TL;DR
This paper introduces a new method for detecting and mitigating rare subclasses in deep neural network classifiers, improving their robustness and interpretability in high-dimensional, underrepresented regions of input space.
Contribution
The authors propose a novel commonality metric and methods for reducing the impact of rare subclasses during training and inference, enhancing classifier performance and reliability.
Findings
Successfully identified rare subclasses in MNIST and Kaggle datasets.
Produced models that compensate for subclass rarity.
Enhanced run-time detection of likely misclassifications.
Abstract
Regions of high-dimensional input spaces that are underrepresented in training datasets reduce machine-learnt classifier performance, and may lead to corner cases and unwanted bias for classifiers used in decision making systems. When these regions belong to otherwise well-represented classes, their presence and negative impact are very hard to identify. We propose an approach for the detection and mitigation of such rare subclasses in deep neural network classifiers. The new approach is underpinned by an easy-to-compute commonality metric that supports the detection of rare subclasses, and comprises methods for reducing the impact of these subclasses during both model training and model exploitation. We demonstrate our approach using two well-known datasets, MNIST's handwritten digits and Kaggle's cats/dogs, identifying rare subclasses and producing models which compensate for subclass…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
