A Unified Theory of Diversity in Ensemble Learning
Danny Wood, Tingting Mu, Andrew Webb, Henry Reeve, Mikel, Luj\'an, Gavin Brown

TL;DR
This paper develops a comprehensive theory of ensemble diversity, linking it to bias-variance decomposition, and clarifies its role as a measure of model fit rather than simply maximizing diversity.
Contribution
It introduces a unified framework connecting diversity with bias and variance, providing exact decompositions for various loss functions and an alternative approach for non-additive losses.
Findings
Diversity is a hidden dimension in the bias-variance decomposition.
Exact bias-variance-diversity decompositions are derived for multiple loss functions.
Diversity should be managed as part of a bias-variance-diversity trade-off.
Abstract
We present a theory of ensemble diversity, explaining the nature of diversity for a wide range of supervised learning scenarios. This challenge has been referred to as the holy grail of ensemble learning, an open research issue for over 30 years. Our framework reveals that diversity is in fact a hidden dimension in the bias-variance decomposition of the ensemble loss. We prove a family of exact bias-variance-diversity decompositions, for a wide range of losses in both regression and classification, e.g., squared, cross-entropy, and Poisson losses. For losses where an additive bias-variance decomposition is not available (e.g., 0/1 loss) we present an alternative approach: quantifying the effects of diversity, which turn out to be dependent on the label distribution. Overall, we argue that diversity is a measure of model fit, in precisely the same sense as bias and variance, but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms · Neural Networks and Applications
