Enhancing Uncertainty Estimation and Interpretability via Bayesian Non-negative Decision Layer
Xinyue Hu, Zhibin Duan, Bo Chen, Mingyuan Zhou

TL;DR
This paper introduces a Bayesian non-negative decision layer that enhances uncertainty estimation and interpretability in neural networks by modeling complex dependencies and learning disentangled representations.
Contribution
The paper proposes a novel Bayesian non-negative decision layer that improves uncertainty estimation and interpretability through disentangled representations and theoretical guarantees.
Findings
Improved uncertainty estimation accuracy
Enhanced interpretability via disentangled features
Better model performance with the proposed layer
Abstract
Although deep neural networks have demonstrated significant success due to their powerful expressiveness, most models struggle to meet practical requirements for uncertainty estimation. Concurrently, the entangled nature of deep neural networks leads to a multifaceted problem, where various localized explanation techniques reveal that multiple unrelated features influence the decisions, thereby undermining interpretability. To address these challenges, we develop a Bayesian Non-negative Decision Layer (BNDL), which reformulates deep neural networks as a conditional Bayesian non-negative factor analysis. By leveraging stochastic latent variables, the BNDL can model complex dependencies and provide robust uncertainty estimation. Moreover, the sparsity and non-negativity of the latent variables encourage the model to learn disentangled representations and decision layers, thereby improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
