Solving High Frequency and Multi-Scale PDEs with Gaussian Processes

Shikai Fang; Madison Cooley; Da Long; Shibo Li; Robert Kirby; Shandian; Zhe

arXiv:2311.04465·cs.LG·March 20, 2024·1 cites

Solving High Frequency and Multi-Scale PDEs with Gaussian Processes

Shikai Fang, Madison Cooley, Da Long, Shibo Li, Robert Kirby, Shandian, Zhe

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

This paper introduces a Gaussian process framework with spectral mixture kernels to effectively solve high-frequency and multi-scale PDEs, overcoming spectral bias issues faced by neural network-based methods like PINNs.

Contribution

The authors develop a scalable Gaussian process approach using spectral mixture kernels and Kronecker algebra to accurately solve complex PDEs with high frequencies and multiple scales.

Findings

01

Outperforms PINNs on high-frequency PDEs

02

Efficiently handles large collocation point sets

03

Automatically prunes unnecessary frequencies

Abstract

Machine learning based solvers have garnered much attention in physical simulation and scientific computing, with a prominent example, physics-informed neural networks (PINNs). However, PINNs often struggle to solve high-frequency and multi-scale PDEs, which can be due to spectral bias during neural network training. To address this problem, we resort to the Gaussian process (GP) framework. To flexibly capture the dominant frequencies, we model the power spectrum of the PDE solution with a student $t$ mixture or Gaussian mixture. We apply the inverse Fourier transform to obtain the covariance function (by Wiener-Khinchin theorem). The covariance derived from the Gaussian mixture spectrum corresponds to the known spectral mixture kernel. Next, we estimate the mixture weights in the log domain, which we show is equivalent to placing a Jeffreys prior. It automatically induces sparsity,…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

Overall, the paper presents a promising idea for how to solve certain classes of PDEs with Gaussian processes. The idea of using a sparsity-inducing prior over the frequencies of a spectral mixture kernel seems very useful and the paper demonstrates the robustness and applicability of the approach.

Weaknesses

There are some weaknesses in the paper regarding the claimed novelty of the approach, the experimental evaluation and the proper attribution of existing approaches. ## Originality In my opinion the following two statements from the paper are incorrect or at least too strong for what is presented in the paper: 1. Quote from the Abstract (also mentioned in contributions): "The covariance derived from the Gaussian mixture spectrum corresponds to the known spectral mixture kernel. We are the ﬁrst

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 2

Strengths

The authors described their method in detail, analyzed the runtime complexity, and provided multiple sets of experiments.

Weaknesses

The paper abstract right now is very hard to follow for interested readers, instead of focussing on what the authors did in their methodologies, it should focus mainly on the contributions from a high level. The introduction clarifies the authors’ motivation well, however certain rearrangement of figures can help make it better. When the authors say their method is focussed on high fidelity and multi-scale PDEs, they can exhibit this with a figure (perhaps by moving figure 2,3 up in the first 2

Reviewer 03Rating 5· marginally below the acceptance thresholdConfidence 3

Strengths

- The paper is well-structured and the problem is clearly introduced and motivated. - The authors perform several numerical experiments on synthetic datasets, which show that their method outperform PINNs in terms of resulting accuracy of the solution.

Weaknesses

- One of the weaknesses of the paper is that the difference between prior GP-based works in the literature is unclear. The authors should clearly state the differences between their work and the existing ones, in particular the papers by Chen et al (JCP, 2021) and Raissi et al (JCP, 2017), cited in this work. I believe that the two differences are: (1) The use of student-t distribution and (2) using a Kronecker product grid to speed-up computations. In the current version, the title and abstract

Code & Models

Repositories

xuangu-fang/gaussian-process-slover-for-high-freq-pde
jaxOfficial

Videos

Solving High Frequency and Multi-Scale PDEs with Gaussian Processes· slideslive

Taxonomy

TopicsScientific Research and Discoveries

MethodsGaussian Process