How many classifiers do we need?

Hyunsuk Kim; Liam Hodgkinson; Ryan Theisen; Michael W. Mahoney

arXiv:2411.00328·stat.ML·November 4, 2024

How many classifiers do we need?

Hyunsuk Kim, Liam Hodgkinson, Ryan Theisen, Michael W. Mahoney

PDF

Open Access 1 Video

TL;DR

This paper analyzes how classifier disagreement and polarization influence ensemble performance, introducing a neural polarization law and providing bounds and asymptotic behaviors to guide ensemble size decisions.

Contribution

It introduces the concept of polarization in classifier ensembles, derives bounds on polarization and disagreement, and links these to ensemble performance and size predictions.

Findings

01

Polarization is nearly constant across hyperparameters and architectures.

02

Disagreement correlates linearly with the target under entropy conditions.

03

Asymptotic analysis predicts ensemble performance from smaller ensembles.

Abstract

As performance gains through scaling data and/or model size experience diminishing returns, it is becoming increasingly popular to turn to ensembling, where the predictions of multiple models are combined to improve accuracy. In this paper, we provide a detailed analysis of how the disagreement and the polarization (a notion we introduce and define in this paper) among classifiers relate to the performance gain achieved by aggregating individual classifiers, for majority vote strategies in classification tasks. We address these questions in the following ways. (1) An upper bound for polarization is derived, and we propose what we call a neural polarization law: most interpolating neural network models are 4/3-polarized. Our empirical results not only support this conjecture but also show that polarization is nearly constant for a dataset, regardless of hyperparameters or architectures…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

How many classifiers do we need?· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Opinion Dynamics and Social Influence · Sentiment Analysis and Opinion Mining