A clustered Gaussian process model for computer experiments

Chih-Li Sung; Benjamin Haaland; Youngdeok Hwang; Siyuan Lu

arXiv:1911.04602·stat.ME·November 6, 2020

A clustered Gaussian process model for computer experiments

Chih-Li Sung, Benjamin Haaland, Youngdeok Hwang, Siyuan Lu

PDF

Open Access

TL;DR

This paper introduces a clustered Gaussian process model that segments data into clusters for improved emulation of computer experiments, addressing stationarity and scalability issues, and demonstrates its effectiveness through simulations and real data.

Contribution

The paper proposes a novel clustered Gaussian process model with an efficient EM algorithm, enhancing scalability and interpretability for computer experiment emulation.

Findings

01

Smaller mean square errors compared to competitors

02

Competitive computation time

03

Provides insights through cluster discovery

Abstract

A Gaussian process has been one of the important approaches for emulating computer simulations. However, the stationarity assumption for a Gaussian process and the intractability for large-scale dataset limit its availability in practice. In this article, we propose a clustered Gaussian process model which segments the input data into multiple clusters, in each of which a Gaussian process model is performed. The stochastic expectation-maximization is employed to efficiently fit the model. In our simulations as well as a real application to solar irradiance emulation, our proposed method had smaller mean square errors than its main competitors, with competitive computation time, and provides valuable insights from data by discovering the clusters. An R package for the proposed methodology is provided in an open repository.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference · Solar Radiation and Photovoltaics