DS-UI: Dual-Supervised Mixture of Gaussian Mixture Models for Uncertainty Inference
Jiyang Xie, Zhanyu Ma, Jing-Hao Xue, Guoqiang Zhang, Jun, Guo

TL;DR
This paper introduces DS-UI, a dual-supervised framework combining Gaussian mixture models with deep neural networks to improve uncertainty inference, especially in out-of-domain detection, outperforming existing methods.
Contribution
The paper presents a novel MoGMM-FC layer and a dual-supervised variational Bayes algorithm for enhanced uncertainty estimation in DNNs.
Findings
Outperforms state-of-the-art UI methods in misclassification detection
Significant improvements in open-set out-of-domain detection
Visualizations confirm superior feature space separation
Abstract
This paper proposes a dual-supervised uncertainty inference (DS-UI) framework for improving Bayesian estimation-based uncertainty inference (UI) in deep neural network (DNN)-based image recognition. In the DS-UI, we combine the classifier of a DNN, i.e., the last fully-connected (FC) layer, with a mixture of Gaussian mixture models (MoGMM) to obtain an MoGMM-FC layer. Unlike existing UI methods for DNNs, which only calculate the means or modes of the DNN outputs' distributions, the proposed MoGMM-FC layer acts as a probabilistic interpreter for the features that are inputs of the classifier to directly calculate the probability density of them for the DS-UI. In addition, we propose a dual-supervised stochastic gradient-based variational Bayes (DS-SGVB) algorithm for the MoGMM-FC layer optimization. Unlike conventional SGVB and optimization algorithms in other UI methods, the DS-SGVB not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Gaussian Processes and Bayesian Inference
