Multi-variance replica exchange stochastic gradient MCMC for inverse and forward Bayesian physics-informed neural network
Guang Lin, Yating Wang, Zecheng Zhang

TL;DR
This paper introduces a multi-variance replica exchange stochastic gradient Langevin diffusion method to improve Bayesian physics-informed neural network training, effectively addressing local optima and multimodal posteriors with enhanced efficiency and accuracy.
Contribution
It proposes a novel multi-variance replica exchange scheme with different energy function assumptions, enabling efficient Bayesian PINN training for complex inverse and forward problems.
Findings
Faster convergence compared to SGLD and vanilla replica exchange methods.
Successfully solved inverse problems with multiple modes.
Achieved more accurate results in Bayesian PINN training.
Abstract
Physics-informed neural network (PINN) has been successfully applied in solving a variety of nonlinear non-convex forward and inverse problems. However, the training is challenging because of the non-convex loss functions and the multiple optima in the Bayesian inverse problem. In this work, we propose a multi-variance replica exchange stochastic gradient Langevin diffusion method to tackle the challenge of the multiple local optima in the optimization and the challenge of the multiple modal posterior distribution in the inverse problem. Replica exchange methods are capable of escaping from the local traps and accelerating the convergence. However, it may not be efficient to solve mathematical inversion problems by using the vanilla replica method directly since the method doubles the computational cost in evaluating the forward solvers (likelihood functions) in the two chains. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference
