Improved learning theory for kernel distribution regression with two-stage sampling

Fran\c{c}ois Bachoc; Louis B\'ethune; Alberto Gonz\'alez-Sanz; Jean-Michel Loubes

arXiv:2308.14335·math.ST·September 22, 2025

Improved learning theory for kernel distribution regression with two-stage sampling

Fran\c{c}ois Bachoc, Louis B\'ethune, Alberto Gonz\'alez-Sanz, Jean-Michel Loubes

PDF

Open Access 1 Repo

TL;DR

This paper advances the theoretical understanding of kernel distribution regression with two-stage sampling, providing improved error bounds and convergence rates for a broad class of kernels based on Hilbertian embeddings.

Contribution

It introduces a novel near-unbiased condition for Hilbertian embeddings, leading to tighter error bounds and improved convergence rates in kernel distribution regression.

Findings

01

The near-unbiased condition holds for kernels based on optimal transport and mean embedding.

02

New error bounds are established for the effect of two-stage sampling.

03

Convergence rates are strictly improved for the considered kernels.

Abstract

The distribution regression problem encompasses many important statistics and machine learning tasks, and arises in a large range of applications. Among various existing approaches to tackle this problem, kernel methods have become a method of choice. Indeed, kernel distribution regression is both computationally favorable, and supported by a recent learning theory. This theory also tackles the two-stage sampling setting, where only samples from the input distributions are available. In this paper, we improve the learning theory of kernel distribution regression. We address kernels based on Hilbertian embeddings, that encompass most, if not all, of the existing approaches. We introduce the novel near-unbiased condition on the Hilbertian embeddings, that enables us to provide new error bounds on the effect of the two-stage sampling, thanks to a new analysis. We show that this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

algue-rythme/distributionregressionus2016
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMicroRNA in disease regulation · Statistical Methods and Inference · Machine Learning and ELM