Learning Algorithm Generalization Error Bounds via Auxiliary Distributions
Gholamali Aminian, Saeed Masiha, Laura Toni, Miguel R. D. Rodrigues

TL;DR
This paper introduces the Auxiliary Distribution Method to derive new, tighter upper bounds on the generalization error in supervised learning, utilizing $oldsymbol{ extit{ extalpha}}$-Jensen-Shannon and Rényi information measures.
Contribution
The work presents a novel auxiliary distribution approach that yields new upper bounds on generalization errors, including bounds involving $ extalpha$-Jensen-Shannon and Rényi divergences, applicable under distribution mismatch scenarios.
Findings
Bounds are specialized for $ extalpha$-Jensen-Shannon and Rényi information.
Bounds are finite and can be tighter than previous bounds.
Method applies to excess risk and distribution mismatch scenarios.
Abstract
Generalization error bounds are essential for comprehending how well machine learning models work. In this work, we suggest a novel method, i.e., the Auxiliary Distribution Method, that leads to new upper bounds on expected generalization errors that are appropriate for supervised learning scenarios. We show that our general upper bounds can be specialized under some conditions to new bounds involving the -Jensen-Shannon, -R\'enyi () information between a random variable modeling the set of training samples and another random variable modeling the set of hypotheses. Our upper bounds based on -Jensen-Shannon information are also finite. Additionally, we demonstrate how our auxiliary distribution method can be used to derive the upper bounds on excess risk of some learning algorithms in the supervised learning context {\blue and the generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Face and Expression Recognition
MethodsTest
