Guarantees for Nonlinear Representation Learning: Non-identical   Covariates, Dependent Data, Fewer Samples

Thomas T. Zhang; Bruce D. Lee; Ingvar Ziemann; George J. Pappas,; Nikolai Matni

arXiv:2410.11227·stat.ML·October 16, 2024

Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples

Thomas T. Zhang, Bruce D. Lee, Ingvar Ziemann, George J. Pappas,, Nikolai Matni

PDF

Open Access

TL;DR

This paper provides theoretical guarantees for learning nonlinear representations from multiple sources with non-identical distributions and dependencies, showing how task diversity and data quantity influence sample complexity and risk bounds.

Contribution

It introduces a framework for analyzing sample complexity and risk in nonlinear representation learning with dependent, non-i.i.d. data across multiple tasks.

Findings

01

Sample complexity depends on data dependency and task diversity.

02

Risk bounds improve with more tasks, approaching iid regression performance.

03

Dependency affects sample requirements but not the asymptotic risk bound.

Abstract

A driving force behind the diverse applicability of modern machine learning is the ability to extract meaningful features across many sources. However, many practical domains involve data that are non-identically distributed across sources, and statistically dependent within its source, violating vital assumptions in existing theoretical studies. Toward addressing these issues, we establish statistical guarantees for learning general $nonlinear$ representations from multiple data sources that admit different input distributions and possibly dependent data. Specifically, we study the sample-complexity of learning $T + 1$ functions $f_{⋆}^{(t)} \circ g_{⋆}$ from a function class $F \times G$ , where $f_{⋆}^{(t)}$ are task specific linear functions and $g_{⋆}$ is a shared nonlinear representation. A representation $\overset{g}{^}$ is estimated using $N$ samples from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications