Improving the performance of Stein variational inference through extreme sparsification of physically-constrained neural network models
Govinda Anantha Padmanabha, Jan Niklas Fuhg, Cosmin Safta, Reese E., Jones, Nikolaos Bouklas

TL;DR
This paper introduces an $L_0$ sparsification prior combined with Stein variational gradient descent to improve uncertainty quantification in neural network models for scientific machine learning, reducing computational costs and enhancing robustness.
Contribution
The novel integration of $L_0$ sparsification with SVGD offers a more robust and efficient approach for uncertainty quantification in high-dimensional neural network models.
Findings
$L_0$+SVGD outperforms standard SVGD in noise resilience.
$L_0$+SVGD achieves faster convergence to optimal solutions.
$L_0$+SVGD performs well in extrapolated regions.
Abstract
Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that sparsification prior to Stein variational gradient descent (+SVGD) is a more robust and efficient means of uncertainty quantification, in terms of computational cost and performance than the direct application of SGVD or projected SGVD methods. Specifically, +SVGD demonstrates superior resilience to noise, the ability to perform well in extrapolated regions, and a faster convergence rate to an optimal solution.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Seismic Imaging and Inversion Techniques
