Bias-Variance Decompositions for Margin Losses

Danny Wood; Tingting Mu; Gavin Brown

arXiv:2204.12155·stat.ML·April 27, 2022

Bias-Variance Decompositions for Margin Losses

Danny Wood, Tingting Mu, Gavin Brown

PDF

Open Access

TL;DR

This paper develops a new bias-variance decomposition for strictly convex margin losses, offering insights into model behavior and ensemble diversity, with practical diagnostic implications for overfitting and underfitting.

Contribution

It introduces a novel bias-variance decomposition applicable to a range of convex margin losses, enhancing understanding of model risk and diversity.

Findings

01

Decomposition applies to logistic, squared, and canonical boosting losses.

02

Provides a diagnostic tool for overfitting and underfitting analysis.

03

Establishes a link between bias-variance and ensemble diversity.

Abstract

We introduce a novel bias-variance decomposition for a range of strictly convex margin losses, including the logistic loss (minimized by the classic LogitBoost algorithm), as well as the squared margin loss and canonical boosting loss. Furthermore, we show that, for all strictly convex margin losses, the expected risk decomposes into the risk of a "central" model and a term quantifying variation in the functional margin with respect to variations in the training data. These decompositions provide a diagnostic tool for practitioners to understand model overfitting/underfitting, and have implications for additive ensemble models -- for example, when our bias-variance decomposition holds, there is a corresponding "ambiguity" decomposition, which can be used to quantify model diversity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Health Systems, Economic Evaluations, Quality of Life · Advanced Causal Inference Techniques