Rethinking Generalisation
Antonia Marcu, Adam Pr\"ugel-Bennett

TL;DR
This paper introduces a novel method for estimating generalisation performance based on the known distribution of risks, emphasizing the importance of the distribution's behavior near its minimum, and applies it to Boolean functions and perceptrons.
Contribution
It proposes a new approach to compute expected error using risk distribution and introduces the concept of attunement, with detailed analysis for Boolean functions and perceptrons.
Findings
Risk distribution's power-law behavior influences generalisation
Simplified and corrected models show different error predictions
Perceptron and Boolean functions analyzed for risk distribution
Abstract
In this paper, a new approach to computing the generalisation performance is presented that assumes the distribution of risks, , for a learning scenario is known. From this, the expected error of a learning machine using empirical risk minimisation is computed for both classification and regression problems. A critical quantity in determining the generalisation performance is the power-law behaviour of around its minimum value---a quantity we call attunement. The distribution is computed for the case of all Boolean functions and for the perceptron used in two different problem settings. Initially a simplified analysis is presented where an independence assumption about the losses is made. A more accurate analysis is carried out taking into account chance correlations in the training set. This leads to corrections in the typical behaviour that is observed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
