Margin Maximization as Lossless Maximal Compression

Nikolaos Nikolaou; Henry Reeve; Gavin Brown

arXiv:2001.10318·cs.LG·January 29, 2020·1 cites

Margin Maximization as Lossless Maximal Compression

Nikolaos Nikolaou, Henry Reeve, Gavin Brown

PDF

Open Access 1 Repo

TL;DR

This paper interprets margin maximization in classification as achieving lossless maximal compression of training data, providing new insights into generalization and explaining the success of algorithms like gradient boosting.

Contribution

It offers an information-theoretic perspective on margin maximization, connecting it to lossless data compression and broadening understanding of generalization in supervised learning.

Findings

01

Margin maximization corresponds to lossless data compression.

02

The interpretation explains the success of gradient boosting.

03

The approach provides theoretical and empirical support for the connection.

Abstract

The ultimate goal of a supervised learning algorithm is to produce models constructed on the training data that can generalize well to new examples. In classification, functional margin maximization -- correctly classifying as many training examples as possible with maximal confidence --has been known to construct models with good generalization guarantees. This work gives an information-theoretic interpretation of a margin maximizing model on a noiseless training dataset as one that achieves lossless maximal compression of said dataset -- i.e. extracts from the features all the useful information for predicting the label and no more. The connection offers new insights on generalization in supervised machine learning, showing margin maximization as a special case (that of classification) of a more general principle and explains the success and potential limitations of popular learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nnikolaou/margin_maximization_LMC
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning