Random Sampling High Dimensional Model Representation Gaussian Process Regression (RS-HDMR-GPR) for representing multidimensional functions with machine-learned lower-dimensional terms allowing insight with a general method
Owen Ren, Mohamed Ali Boussaidi, Dmitry Voytsekhovsky, Manabu Ihara,, and Sergei Manzhos

TL;DR
This paper introduces RS-HDMR-GPR, a Python-based method combining high-dimensional model representation with Gaussian process regression to efficiently model complex multivariate functions, especially from sparse data, with added interpretability.
Contribution
The paper presents a novel Python implementation of RS-HDMR-GPR that enables low-dimensional representation, missing data imputation, and variable importance estimation for multidimensional functions.
Findings
Effective in recovering functional dependence from sparse data
Capable of imputing missing variable values
Provides insights into variable importance
Abstract
We present a Python implementation for RS-HDMR-GPR (Random Sampling High Dimensional Model Representation Gaussian Process Regression). The method builds representations of multivariate functions with lower-dimensional terms, either as an expansion over orders of coupling or using terms of only a given dimensionality. This facilitates, in particular, recovering functional dependence from sparse data. The code also allows for imputation of missing values of the variables and for a significant pruning of the useful number of HDMR terms. The code can also be used for estimating relative importance of different combinations of input variables, thereby adding an element of insight to a general machine learning method. The capabilities of this regression tool are demonstrated on test cases involving synthetic analytic functions, the potential energy surface of the water molecule, kinetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Mass Spectrometry Techniques and Applications · Machine Learning in Materials Science
MethodsPruning · Gaussian Process
