Multi-variance replica exchange stochastic gradient MCMC for inverse and   forward Bayesian physics-informed neural network

Guang Lin; Yating Wang; Zecheng Zhang

arXiv:2107.06330·math.NA·July 15, 2021·6 cites

Multi-variance replica exchange stochastic gradient MCMC for inverse and forward Bayesian physics-informed neural network

Guang Lin, Yating Wang, Zecheng Zhang

PDF

Open Access

TL;DR

This paper introduces a multi-variance replica exchange stochastic gradient Langevin diffusion method to improve Bayesian physics-informed neural network training, effectively addressing local optima and multimodal posteriors with enhanced efficiency and accuracy.

Contribution

It proposes a novel multi-variance replica exchange scheme with different energy function assumptions, enabling efficient Bayesian PINN training for complex inverse and forward problems.

Findings

01

Faster convergence compared to SGLD and vanilla replica exchange methods.

02

Successfully solved inverse problems with multiple modes.

03

Achieved more accurate results in Bayesian PINN training.

Abstract

Physics-informed neural network (PINN) has been successfully applied in solving a variety of nonlinear non-convex forward and inverse problems. However, the training is challenging because of the non-convex loss functions and the multiple optima in the Bayesian inverse problem. In this work, we propose a multi-variance replica exchange stochastic gradient Langevin diffusion method to tackle the challenge of the multiple local optima in the optimization and the challenge of the multiple modal posterior distribution in the inverse problem. Replica exchange methods are capable of escaping from the local traps and accelerating the convergence. However, it may not be efficient to solve mathematical inversion problems by using the vanilla replica method directly since the method doubles the computational cost in evaluating the forward solvers (likelihood functions) in the two chains. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference