Accelerating Hamiltonian Monte Carlo for Bayesian Inference in Neural Networks and Neural Operators

Ponkrshnan Thiagarajan; Tamer A. Zaki; Michael D. Shields

arXiv:2507.14652·stat.ML·September 11, 2025

Accelerating Hamiltonian Monte Carlo for Bayesian Inference in Neural Networks and Neural Operators

Ponkrshnan Thiagarajan, Tamer A. Zaki, Michael D. Shields

PDF

Open Access

TL;DR

This paper introduces a hybrid approach combining variational inference and Hamiltonian Monte Carlo to efficiently and accurately quantify uncertainties in neural networks, significantly reducing computational costs for large models.

Contribution

It presents a novel hybrid method that accelerates HMC by identifying and focusing on influential parameters after initial VI training, improving uncertainty quantification in neural networks.

Findings

01

Efficient inference on networks with tens to hundreds of thousands of parameters.

02

Accurate uncertainty quantification for complex physical system models.

03

Significant reduction in HMC computational cost without sacrificing accuracy.

Abstract

Hamiltonian Monte Carlo (HMC) is a powerful and accurate method to sample from the posterior distribution in Bayesian inference. However, HMC techniques are computationally demanding for Bayesian neural networks due to the high dimensionality of the network's parameter space and the non-convexity of their posterior distributions. Therefore, various approximation techniques, such as variational inference (VI) or stochastic gradient MCMC, are often employed to infer the posterior distribution of the network parameters. Such approximations introduce inaccuracies in the inferred distributions, resulting in unreliable uncertainty estimates. In this work, we propose a hybrid approach that combines inexpensive VI and accurate HMC methods to efficiently and accurately quantify uncertainties in neural networks and neural operators. The proposed approach leverages an initial VI training on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference