Bayesian Deep Learning Hyperparameter Search for Robust Function Mapping to Polynomials with Noise
Nidhin Harilal, Udit Bhatia, Auroop R. Ganguly

TL;DR
This paper investigates how to optimize Bayesian Deep Learning hyperparameters, such as depth and ensemble size, for robustly mapping noisy polynomial functions and quantifying uncertainty, revealing optimal configurations for different noise conditions.
Contribution
It provides empirical insights into selecting neural architecture hyperparameters for noise-robust polynomial function mapping with uncertainty quantification in Bayesian Deep Learning.
Findings
Optimal network depth exists for prediction and uncertainty quantification.
Ensemble size influences uncertainty estimation accuracy.
Width has limited impact on performance at high values.
Abstract
Advances in neural architecture search, as well as explainability and interpretability of connectionist architectures, have been reported in the recent literature. However, our understanding of how to design Bayesian Deep Learning (BDL) hyperparameters, specifically, the depth, width and ensemble size, for robust function mapping with uncertainty quantification, is still emerging. This paper attempts to further our understanding by mapping Bayesian connectionist representations to polynomials of different orders with varying noise types and ratios. We examine the noise-contaminated polynomials to search for the combination of hyperparameters that can extract the underlying polynomial signals while quantifying uncertainties based on the noise attributes. Specifically, we attempt to study the question that an appropriate neural architecture and ensemble configuration can be found to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Model Reduction and Neural Networks
