Generalization bounds for learning under graph-dependence: A survey

Rui-Ray Zhang; Massih-Reza Amini

arXiv:2203.13534·cs.LG·April 9, 2024·1 cites

Generalization bounds for learning under graph-dependence: A survey

Rui-Ray Zhang, Massih-Reza Amini

PDF

Open Access

TL;DR

This survey reviews recent advances in understanding how dependence among data points, modeled by graphs, affects generalization bounds in statistical learning, highlighting new concentration bounds and their applications.

Contribution

It compiles and discusses graph-dependent concentration bounds and their use in deriving generalization bounds, providing a comprehensive overview of this emerging research area.

Findings

01

Collected various graph-dependent concentration bounds

02

Derived Rademacher complexity and stability bounds for dependent data

03

Outlined practical learning scenarios and future research directions

Abstract

Traditional statistical learning theory relies on the assumption that data are identically and independently distributed (i.i.d.). However, this assumption often does not hold in many real-life applications. In this survey, we explore learning scenarios where examples are dependent and their dependence relationship is described by a dependency graph, a commonly utilized model in probability and combinatorics. We collect various graph-dependent concentration bounds, which are then used to derive Rademacher complexity and stability generalization bounds for learning from graph-dependent data. We illustrate this paradigm through practical learning tasks and provide some research directions for future work. To our knowledge, this survey is the first of this kind on this subject.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Gene expression and cancer classification · Machine Learning and Data Classification