Rethinking Fano's Inequality in Ensemble Learning
Terufumi Morishita, Gaku Morio, Shota Horiguchi, Hiroaki Ozaki, Nobuo, Nukaga

TL;DR
This paper reexamines Fano's inequality in ensemble learning, introducing the concept of combination loss to better understand factors influencing ensemble system performance.
Contribution
It generalizes existing theory by incorporating information loss during prediction combination, providing a more accurate framework for ensemble system analysis.
Findings
Theoretical framework now accounts for combination loss.
Empirical validation demonstrates the theory's effectiveness.
Insights into system strengths and weaknesses on key metrics.
Abstract
We propose a fundamental theory on ensemble learning that answers the central question: what factors make an ensemble system good or bad? Previous studies used a variant of Fano's inequality of information theory and derived a lower bound of the classification error rate on the basis of the and of models. We revisit the original Fano's inequality and argue that the studies did not take into account the information lost when multiple model predictions are combined into a final prediction. To address this issue, we generalize the previous theory to incorporate the information loss, which we name . Further, we empirically validate and demonstrate the proposed theory through extensive experiments on actual systems. The theory reveals the strengths and weaknesses of systems on each metric, which will push the theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Neural Networks and Applications · Data Stream Mining Techniques
