Uncertainty in AI: Evaluating Deep Neural Networks on Out-of-Distribution Images
Jamiu Idowu, Ahmed Almasoud

TL;DR
This paper evaluates the uncertainty and robustness of various deep neural networks on out-of-distribution and perturbed images, demonstrating ensemble methods improve classification accuracy but models remain vulnerable to perturbations.
Contribution
It introduces a comprehensive evaluation of DNNs' uncertainty on OOD data and proposes ensemble techniques to enhance classification performance.
Findings
Ensemble models outperform single models on OOD image classification.
ResNet-50 achieves the highest accuracy among tested models.
Models are highly vulnerable to image perturbations, misclassifying all perturbed images.
Abstract
As AI models are increasingly deployed in critical applications, ensuring the consistent performance of models when exposed to unusual situations such as out-of-distribution (OOD) or perturbed data, is important. Therefore, this paper investigates the uncertainty of various deep neural networks, including ResNet-50, VGG16, DenseNet121, AlexNet, and GoogleNet, when dealing with such data. Our approach includes three experiments. First, we used the pretrained models to classify OOD images generated via DALL-E to assess their performance. Second, we built an ensemble from the models' predictions using probabilistic averaging for consensus due to its advantages over plurality or majority voting. The ensemble's uncertainty was quantified using average probabilities, variance, and entropy metrics. Our results showed that while ResNet-50 was the most accurate single model for OOD images, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
