TL;DR
This paper evaluates the effectiveness of Bayesian deep learning models in out-of-distribution detection and adversarial robustness, revealing marginal improvements and highlighting factors influencing model sensitivity to unseen data.
Contribution
It systematically tests likelihood-based Bayesian models for OoD detection and adversarial robustness, uncovering their limitations and the impact of architectural choices.
Findings
Bayesian models sometimes outperform conventional neural networks in OoD detection.
Models show reduced AUC scores when in/out distributions have minimal overlap.
Initialisation, architecture, and activation functions influence model sensitivity to unseen data.
Abstract
Deep neural networks have been successful in diverse discriminative classification tasks, although, they are poorly calibrated often assigning high probability to misclassified predictions. Potential consequences could lead to trustworthiness and accountability of the models when deployed in real applications, where predictions are evaluated based on their confidence scores. Existing solutions suggest the benefits attained by combining deep neural networks and Bayesian inference to quantify uncertainty over the models' predictions for ambiguous datapoints. In this work we propose to validate and test the efficacy of likelihood based models in the task of out of distribution detection (OoD). Across different datasets and metrics we show that Bayesian deep learning models on certain occasions marginally outperform conventional neural networks and in the event of minimal overlap between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
