From Overfitting to Reliability: Introducing the Hierarchical Approximate Bayesian Neural Network

Hayk Amirkhanian; Marco F. Huber

arXiv:2512.13111·cs.LG·December 16, 2025

From Overfitting to Reliability: Introducing the Hierarchical Approximate Bayesian Neural Network

Hayk Amirkhanian, Marco F. Huber

PDF

Open Access

TL;DR

This paper introduces the Hierarchical Approximate Bayesian Neural Network (HABNN), which enhances neural network robustness and uncertainty estimation by using a Gaussian-inverse-Wishart hyperprior, demonstrating superior performance especially on out-of-distribution data.

Contribution

The paper proposes a novel hierarchical Bayesian neural network model with a Gaussian-inverse-Wishart hyperprior, providing analytical solutions for predictive distribution and weight posterior in closed form.

Findings

01

HABNN effectively mitigates overfitting.

02

It provides reliable uncertainty estimates for out-of-distribution data.

03

The model often outperforms state-of-the-art approaches.

Abstract

In recent years, neural networks have revolutionized various domains, yet challenges such as hyperparameter tuning and overfitting remain significant hurdles. Bayesian neural networks offer a framework to address these challenges by incorporating uncertainty directly into the model, yielding more reliable predictions, particularly for out-of-distribution data. This paper presents Hierarchical Approximate Bayesian Neural Network, a novel approach that uses a Gaussian-inverse-Wishart distribution as a hyperprior of the network's weights to increase both the robustness and performance of the model. We provide analytical representations for the predictive distribution and weight posterior, which amount to the calculation of the parameters of Student's t-distributions in closed form with linear complexity with respect to the number of weights. Our method demonstrates robust performance,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Machine Learning and ELM