Information, Divergence and Risk for Binary Experiments

Mark D. Reid; Robert C. Williamson

arXiv:0901.0356·stat.ML·January 6, 2009·J. Mach. Learn. Res.·4 cites

Information, Divergence and Risk for Binary Experiments

Mark D. Reid, Robert C. Williamson

PDF

Open Access

TL;DR

This paper unifies various divergence measures and loss functions within a common framework related to binary classification, leading to new bounds, inequalities, and insights into existing algorithms.

Contribution

It introduces a systematic integral and variational approach that unifies divergences, loss bounds, and information measures, revealing new relationships and derivations in binary learning.

Findings

01

Derived tight surrogate loss bounds

02

Established generalized Pinsker inequalities

03

Provided new derivations of SVMs and links to MMD

Abstract

We unify f-divergences, Bregman divergences, surrogate loss bounds (regret bounds), proper scoring rules, matching losses, cost curves, ROC-curves and information. We do this by systematically studying integral and variational representations of these objects and in so doing identify their primitives which all are related to cost-sensitive binary classification. As well as clarifying relationships between generative and discriminative views of learning, the new machinery leads to tight and more general surrogate loss bounds and generalised Pinsker inequalities relating f-divergences to variational divergence. The new viewpoint illuminates existing algorithms: it provides a new derivation of Support Vector Machines in terms of divergences and relates Maximum Mean Discrepancy to Fisher Linear Discriminants. It also suggests new techniques for estimating f-divergences.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Statistical Mechanics and Entropy · Bayesian Modeling and Causal Inference