Ensembling over Classifiers: a Bias-Variance Perspective
Neha Gupta, Jamie Smith, Ben Adlam, Zelda Mariet

TL;DR
This paper offers a bias-variance perspective on classifier ensembles, revealing that ensembling reduces variance and can unexpectedly reduce bias, especially in deep learning hyperparameter ensembles, challenging classical views.
Contribution
It introduces a dual reparameterization of the bias-variance tradeoff for classification, providing new insights into how ensembles affect bias and variance.
Findings
Ensembling reduces variance and can reduce bias in classifiers.
Conditional bias/variance estimates necessarily include irreducible error.
Deep learning ensembles over hyperparameters tend to reduce bias.
Abstract
Ensembles are a straightforward, remarkably effective method for improving the accuracy,calibration, and robustness of models on classification tasks; yet, the reasons that underlie their success remain an active area of research. We build upon the extension to the bias-variance decomposition by Pfau (2013) in order to gain crucial insights into the behavior of ensembles of classifiers. Introducing a dual reparameterization of the bias-variance tradeoff, we first derive generalized laws of total expectation and variance for nonsymmetric losses typical of classification tasks. Comparing conditional and bootstrap bias/variance estimates, we then show that conditional estimates necessarily incur an irreducible error. Next, we show that ensembling in dual space reduces the variance and leaves the bias unchanged, whereas standard ensembling can arbitrarily affect the bias. Empirically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Gaussian Processes and Bayesian Inference
