Solving High Frequency and Multi-Scale PDEs with Gaussian Processes
Shikai Fang, Madison Cooley, Da Long, Shibo Li, Robert Kirby, Shandian, Zhe

TL;DR
This paper introduces a Gaussian process framework with spectral mixture kernels to effectively solve high-frequency and multi-scale PDEs, overcoming spectral bias issues faced by neural network-based methods like PINNs.
Contribution
The authors develop a scalable Gaussian process approach using spectral mixture kernels and Kronecker algebra to accurately solve complex PDEs with high frequencies and multiple scales.
Findings
Outperforms PINNs on high-frequency PDEs
Efficiently handles large collocation point sets
Automatically prunes unnecessary frequencies
Abstract
Machine learning based solvers have garnered much attention in physical simulation and scientific computing, with a prominent example, physics-informed neural networks (PINNs). However, PINNs often struggle to solve high-frequency and multi-scale PDEs, which can be due to spectral bias during neural network training. To address this problem, we resort to the Gaussian process (GP) framework. To flexibly capture the dominant frequencies, we model the power spectrum of the PDE solution with a student mixture or Gaussian mixture. We apply the inverse Fourier transform to obtain the covariance function (by Wiener-Khinchin theorem). The covariance derived from the Gaussian mixture spectrum corresponds to the known spectral mixture kernel. Next, we estimate the mixture weights in the log domain, which we show is equivalent to placing a Jeffreys prior. It automatically induces sparsity,…
Peer Reviews
Decision·ICLR 2024 poster
Overall, the paper presents a promising idea for how to solve certain classes of PDEs with Gaussian processes. The idea of using a sparsity-inducing prior over the frequencies of a spectral mixture kernel seems very useful and the paper demonstrates the robustness and applicability of the approach.
There are some weaknesses in the paper regarding the claimed novelty of the approach, the experimental evaluation and the proper attribution of existing approaches. ## Originality In my opinion the following two statements from the paper are incorrect or at least too strong for what is presented in the paper: 1. Quote from the Abstract (also mentioned in contributions): "The covariance derived from the Gaussian mixture spectrum corresponds to the known spectral mixture kernel. We are the first
The authors described their method in detail, analyzed the runtime complexity, and provided multiple sets of experiments.
The paper abstract right now is very hard to follow for interested readers, instead of focussing on what the authors did in their methodologies, it should focus mainly on the contributions from a high level. The introduction clarifies the authors’ motivation well, however certain rearrangement of figures can help make it better. When the authors say their method is focussed on high fidelity and multi-scale PDEs, they can exhibit this with a figure (perhaps by moving figure 2,3 up in the first 2
- The paper is well-structured and the problem is clearly introduced and motivated. - The authors perform several numerical experiments on synthetic datasets, which show that their method outperform PINNs in terms of resulting accuracy of the solution.
- One of the weaknesses of the paper is that the difference between prior GP-based works in the literature is unclear. The authors should clearly state the differences between their work and the existing ones, in particular the papers by Chen et al (JCP, 2021) and Raissi et al (JCP, 2017), cited in this work. I believe that the two differences are: (1) The use of student-t distribution and (2) using a Kronecker product grid to speed-up computations. In the current version, the title and abstract
Code & Models
Videos
Taxonomy
TopicsScientific Research and Discoveries
MethodsGaussian Process
