Bayesian Neural Networks vs. Mixture Density Networks: Theoretical and Empirical Insights for Uncertainty-Aware Nonlinear Modeling
Riddhi Pratim Ghosh, Ian Barnett

TL;DR
This paper compares Bayesian Neural Networks and Mixture Density Networks for uncertainty-aware nonlinear regression, analyzing their theoretical convergence and empirical performance on synthetic and real datasets.
Contribution
It provides a unified theoretical and empirical comparison highlighting the strengths and limitations of BNNs and MDNs in uncertainty modeling.
Findings
MDNs converge faster in KL divergence due to likelihood-based training
BNNs offer more interpretable epistemic uncertainty with limited data
MDNs better capture multimodal and heteroscedastic responses
Abstract
This paper investigates two prominent probabilistic neural modeling paradigms: Bayesian Neural Networks (BNNs) and Mixture Density Networks (MDNs) for uncertainty-aware nonlinear regression. While BNNs incorporate epistemic uncertainty by placing prior distributions over network parameters, MDNs directly model the conditional output distribution, thereby capturing multimodal and heteroscedastic data-generating mechanisms. We present a unified theoretical and empirical framework comparing these approaches. On the theoretical side, we derive convergence rates and error bounds under H\"older smoothness conditions, showing that MDNs achieve faster Kullback-Leibler (KL) divergence convergence due to their likelihood-based nature, whereas BNNs exhibit additional approximation bias induced by variational inference. Empirically, we evaluate both architectures on synthetic nonlinear datasets and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
