Rethinking Fano's Inequality in Ensemble Learning

Terufumi Morishita; Gaku Morio; Shota Horiguchi; Hiroaki Ozaki; Nobuo; Nukaga

arXiv:2205.12683·cs.LG·November 17, 2023

Rethinking Fano's Inequality in Ensemble Learning

Terufumi Morishita, Gaku Morio, Shota Horiguchi, Hiroaki Ozaki, Nobuo, Nukaga

PDF

Open Access 1 Repo

TL;DR

This paper reexamines Fano's inequality in ensemble learning, introducing the concept of combination loss to better understand factors influencing ensemble system performance.

Contribution

It generalizes existing theory by incorporating information loss during prediction combination, providing a more accurate framework for ensemble system analysis.

Findings

01

Theoretical framework now accounts for combination loss.

02

Empirical validation demonstrates the theory's effectiveness.

03

Insights into system strengths and weaknesses on key metrics.

Abstract

We propose a fundamental theory on ensemble learning that answers the central question: what factors make an ensemble system good or bad? Previous studies used a variant of Fano's inequality of information theory and derived a lower bound of the classification error rate on the basis of the $accuracy$ and $diversity$ of models. We revisit the original Fano's inequality and argue that the studies did not take into account the information lost when multiple model predictions are combined into a final prediction. To address this issue, we generalize the previous theory to incorporate the information loss, which we name $combination loss$ . Further, we empirically validate and demonstrate the proposed theory through extensive experiments on actual systems. The theory reveals the strengths and weaknesses of systems on each metric, which will push the theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hitachi-nlp/ensemble-metrics
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Network Analysis Techniques · Neural Networks and Applications · Data Stream Mining Techniques