Machine-Learning-Assisted Comparison of Regression Functions

Jian Yan; Zhuoxi Li; Yang Ning; Yong Chen

arXiv:2510.24714·stat.ME·October 29, 2025

Machine-Learning-Assisted Comparison of Regression Functions

Jian Yan, Zhuoxi Li, Yang Ning, Yong Chen

PDF

TL;DR

This paper introduces a new kernel-based framework for comparing regression functions that leverages machine learning for flexible estimation, overcoming limitations of traditional smoothing methods especially in high-dimensional settings.

Contribution

It proposes a generalized dependence measure and two novel tests for regression equality, with proven asymptotic properties under broad conditions.

Findings

01

Effective in high-dimensional regimes

02

No restrictive distributional assumptions needed

03

Demonstrated superior performance in numerical studies

Abstract

We revisit the classical problem of comparing regression functions, a fundamental question in statistical inference with broad relevance to modern applications such as data integration, transfer learning, and causal inference. Existing approaches typically rely on smoothing techniques and are thus hindered by the curse of dimensionality. We propose a generalized notion of kernel-based conditional mean dependence that provides a new characterization of the null hypothesis of equal regression functions. Building on this reformulation, we develop two novel tests that leverage modern machine learning methods for flexible estimation. We establish the asymptotic properties of the test statistics, which hold under both fixed- and high-dimensional regimes. Unlike existing methods that often require restrictive distributional assumptions, our framework only imposes mild moment conditions. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.