Ensemble Validation: Selectivity has a Price, but Variety is Free

Eric Bax; Farshad Kooti

arXiv:1610.01234·stat.ML·April 1, 2019

Ensemble Validation: Selectivity has a Price, but Variety is Free

Eric Bax, Farshad Kooti

PDF

TL;DR

This paper analyzes the trade-offs in ensemble classifier selection, showing that increasing variety in hypotheses does not incur additional error penalties if the same fraction is used, balancing selectivity and diversity.

Contribution

It introduces an error bound framework for ensembles that balances selectivity and hypothesis set richness without penalty.

Findings

01

Ensemble error bounds depend on selectivity and hypothesis set size.

02

Using more classifiers in the ensemble does not increase error if the same fraction is selected.

03

Richer hypothesis sets can be used freely without penalty.

Abstract

Suppose some classifiers are selected from a set of hypothesis classifiers to form an equally-weighted ensemble that selects a member classifier at random for each input example. Then the ensemble has an error bound consisting of the average error bound for the member classifiers, a term for selectivity that varies from zero (if all hypothesis classifiers are selected) to a standard uniform error bound (if only a single classifier is selected), and small constants. There is no penalty for using a richer hypothesis set if the same fraction of the hypothesis classifiers are selected for the ensemble.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.