Improving the performance of Stein variational inference through extreme   sparsification of physically-constrained neural network models

Govinda Anantha Padmanabha; Jan Niklas Fuhg; Cosmin Safta; Reese E.; Jones; Nikolaos Bouklas

arXiv:2407.00761·cs.LG·July 2, 2024·2 cites

Improving the performance of Stein variational inference through extreme sparsification of physically-constrained neural network models

Govinda Anantha Padmanabha, Jan Niklas Fuhg, Cosmin Safta, Reese E., Jones, Nikolaos Bouklas

PDF

Open Access

TL;DR

This paper introduces an $L_0$ sparsification prior combined with Stein variational gradient descent to improve uncertainty quantification in neural network models for scientific machine learning, reducing computational costs and enhancing robustness.

Contribution

The novel integration of $L_0$ sparsification with SVGD offers a more robust and efficient approach for uncertainty quantification in high-dimensional neural network models.

Findings

01

$L_0$+SVGD outperforms standard SVGD in noise resilience.

02

$L_0$+SVGD achieves faster convergence to optimal solutions.

03

$L_0$+SVGD performs well in extrapolated regions.

Abstract

Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that $L_{0}$ sparsification prior to Stein variational gradient descent ( $L_{0}$ +SVGD) is a more robust and efficient means of uncertainty quantification, in terms of computational cost and performance than the direct application of SGVD or projected SGVD methods. Specifically, $L_{0}$ +SVGD demonstrates superior resilience to noise, the ability to perform well in extrapolated regions, and a faster convergence rate to an optimal solution.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning · Seismic Imaging and Inversion Techniques