Bayesian Neural Networks vs. Mixture Density Networks: Theoretical and Empirical Insights for Uncertainty-Aware Nonlinear Modeling

Riddhi Pratim Ghosh; Ian Barnett

arXiv:2510.25001·stat.CO·October 30, 2025

Bayesian Neural Networks vs. Mixture Density Networks: Theoretical and Empirical Insights for Uncertainty-Aware Nonlinear Modeling

Riddhi Pratim Ghosh, Ian Barnett

PDF

TL;DR

This paper compares Bayesian Neural Networks and Mixture Density Networks for uncertainty-aware nonlinear regression, analyzing their theoretical convergence and empirical performance on synthetic and real datasets.

Contribution

It provides a unified theoretical and empirical comparison highlighting the strengths and limitations of BNNs and MDNs in uncertainty modeling.

Findings

01

MDNs converge faster in KL divergence due to likelihood-based training

02

BNNs offer more interpretable epistemic uncertainty with limited data

03

MDNs better capture multimodal and heteroscedastic responses

Abstract

This paper investigates two prominent probabilistic neural modeling paradigms: Bayesian Neural Networks (BNNs) and Mixture Density Networks (MDNs) for uncertainty-aware nonlinear regression. While BNNs incorporate epistemic uncertainty by placing prior distributions over network parameters, MDNs directly model the conditional output distribution, thereby capturing multimodal and heteroscedastic data-generating mechanisms. We present a unified theoretical and empirical framework comparing these approaches. On the theoretical side, we derive convergence rates and error bounds under H\"older smoothness conditions, showing that MDNs achieve faster Kullback-Leibler (KL) divergence convergence due to their likelihood-based nature, whereas BNNs exhibit additional approximation bias induced by variational inference. Empirically, we evaluate both architectures on synthetic nonlinear datasets and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.