Distribution-Free Distribution Regression

Barnabas Poczos; Alessandro Rinaldo; Aarti Singh; Larry Wasserman

arXiv:1302.0082·stat.ML·February 4, 2013

Distribution-Free Distribution Regression

Barnabas Poczos, Alessandro Rinaldo, Aarti Singh, Larry Wasserman

PDF

TL;DR

This paper develops distribution-free methods for distribution regression, allowing for unknown error distributions and providing theoretical guarantees on prediction risk convergence under certain conditions.

Contribution

It introduces distribution-free distribution regression methods with proven risk convergence rates without assuming specific error or covariate distributions.

Findings

01

Risk converges to zero at a polynomial rate when the effective dimension is small.

02

Theoretical framework applies to models with unknown error and covariate distributions.

03

Provides conditions under which distribution-free distribution regression is effective.

Abstract

`Distribution regression' refers to the situation where a response Y depends on a covariate P where P is a probability distribution. The model is Y=f(P) + mu where f is an unknown regression function and mu is a random error. Typically, we do not observe P directly, but rather, we observe a sample from P. In this paper we develop theory and methods for distribution-free versions of distribution regression. This means that we do not make distributional assumptions about the error term mu and covariate P. We prove that when the effective dimension is small enough (as measured by the doubling dimension), then the excess prediction risk converges to zero with a polynomial rate.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.