Robust Hypothesis Test for Nonlinear Effect with Gaussian Processes
Jeremiah Zhe Liu, Brent Coull

TL;DR
This paper introduces a robust hypothesis testing method for nonlinear effects using Gaussian processes, capable of handling kernel mis-specification and small sample sizes, with applications in detecting feature interactions.
Contribution
It develops a kernel-robust, ensemble-based hypothesis test for nonlinear effects, bridging machine learning and statistical inference insights.
Findings
Test performs well with small samples.
Kernel mis-specification impacts test accuracy.
Estimating hyperparameters via MLE affects inference.
Abstract
This work constructs a hypothesis test for detecting whether an data-generating function belongs to a specific reproducing kernel Hilbert space , where the structure of is only partially known. Utilizing the theory of reproducing kernels, we reduce this hypothesis to a simple one-sided score test for a scalar parameter, develop a testing procedure that is robust against the mis-specification of kernel functions, and also propose an ensemble-based estimator for the null model to guarantee test performance in small samples. To demonstrate the utility of the proposed method, we apply our test to the problem of detecting nonlinear interaction between groups of continuous features. We evaluate the finite-sample performance of our test under different data-generating functions and estimation strategies for the null model. Our results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Statistical Methods and Inference · Advanced Statistical Methods and Models
