A Unified Information-Theoretic Framework for Meta-Learning Generalization
Wen Wen, Tieliang Gong, Yuxin Dong, Zeyu Gao, Yong-Jin Liu

TL;DR
This paper introduces a unified information-theoretic framework for meta-learning that provides tighter, more scalable bounds on generalization gaps, offering new theoretical insights and validated by numerical experiments.
Contribution
It develops a single-step derivation framework for meta-generalization bounds, improving upon previous two-step bounds with enhanced tightness and computational efficiency.
Findings
Unified bounds outperform previous results in tightness and scalability.
Gradient covariance analysis offers new insights into meta-learning algorithms.
Numerical results confirm the bounds effectively capture generalization dynamics.
Abstract
In recent years, information-theoretic generalization bounds have gained increasing attention for analyzing the generalization capabilities of meta-learning algorithms. However, existing results are confined to two-step bounds, failing to provide a sharper characterization of the meta-generalization gap that simultaneously accounts for environment-level and task-level dependencies. This paper addresses this fundamental limitation by developing a unified information-theoretic framework using a single-step derivation. The resulting meta-generalization bounds, expressed in terms of diverse information measures, exhibit substantial advantages over previous work, particularly in terms of tightness, scaling behavior associated with sampled tasks and samples per task, and computational tractability. Furthermore, through gradient covariance analysis, we provide new theoretical insights into the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
