Uncertainty Quantification for Large-Scale Deep Networks via Post-StoNet Modeling
Yan Sun, Faming Liang

TL;DR
This paper introduces a post-processing method using stochastic neural networks to quantify uncertainty in large-scale deep neural networks, providing valid and shorter confidence intervals with better calibration.
Contribution
It proposes a novel post-hoc uncertainty quantification approach with theoretical guarantees, bridging sparse learning theory from linear models to deep neural networks.
Findings
Constructs honest confidence intervals with shorter lengths than conformal methods.
Achieves better calibration compared to other post-hoc techniques.
Provides a theoretical guarantee for the method's validity.
Abstract
Deep learning has revolutionized modern data science. However, how to accurately quantify the uncertainty of predictions from large-scale deep neural networks (DNNs) remains an unresolved issue. To address this issue, we introduce a novel post-processing approach. This approach feeds the output from the last hidden layer of a pre-trained large-scale DNN model into a stochastic neural network (StoNet), then trains the StoNet with a sparse penalty on a validation dataset and constructs prediction intervals for future observations. We establish a theoretical guarantee for the validity of this approach; in particular, the parameter estimation consistency for the sparse StoNet is essential for the success of this approach. Comprehensive experiments demonstrate that the proposed approach can construct honest confidence intervals with shorter interval lengths compared to conformal methods and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Advanced Neural Network Applications
