Rethinking Generalisation

Antonia Marcu; Adam Pr\"ugel-Bennett

arXiv:1911.04301·cs.LG·March 27, 2020

Rethinking Generalisation

Antonia Marcu, Adam Pr\"ugel-Bennett

PDF

Open Access

TL;DR

This paper introduces a novel method for estimating generalisation performance based on the known distribution of risks, emphasizing the importance of the distribution's behavior near its minimum, and applies it to Boolean functions and perceptrons.

Contribution

It proposes a new approach to compute expected error using risk distribution and introduces the concept of attunement, with detailed analysis for Boolean functions and perceptrons.

Findings

01

Risk distribution's power-law behavior influences generalisation

02

Simplified and corrected models show different error predictions

03

Perceptron and Boolean functions analyzed for risk distribution

Abstract

In this paper, a new approach to computing the generalisation performance is presented that assumes the distribution of risks, $ρ (r)$ , for a learning scenario is known. From this, the expected error of a learning machine using empirical risk minimisation is computed for both classification and regression problems. A critical quantity in determining the generalisation performance is the power-law behaviour of $ρ (r)$ around its minimum value---a quantity we call attunement. The distribution $ρ (r)$ is computed for the case of all Boolean functions and for the perceptron used in two different problem settings. Initially a simplified analysis is presented where an independence assumption about the losses is made. A more accurate analysis is carried out taking into account chance correlations in the training set. This leads to corrections in the typical behaviour that is observed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning