The use of entropy to measure structural diversity
L. Masisi, V. Nelwamondo, T. Marwala

TL;DR
This paper compares entropy-based measures to quantify the structural diversity of classifier ensembles, demonstrating that higher diversity correlates with improved accuracy and using genetic algorithms to optimize ensemble composition.
Contribution
It introduces a novel application of entropy measures and information theory to assess and optimize the diversity of classifier ensembles.
Findings
Higher diversity indexes lead to increased ensemble accuracy.
Ensembles with similar classifiers perform poorly.
Genetic algorithms effectively optimize ensemble diversity.
Abstract
In this paper entropy based methods are compared and used to measure structural diversity of an ensemble of 21 classifiers. This measure is mostly applied in ecology, whereby species counts are used as a measure of diversity. The measures used were Shannon entropy, Simpsons and the Berger Parker diversity indexes. As the diversity indexes increased so did the accuracy of the ensemble. An ensemble dominated by classifiers with the same structure produced poor accuracy. Uncertainty rule from information theory was also used to further define diversity. Genetic algorithms were used to find the optimal ensemble by using the diversity indices as the cost function. The method of voting was used to aggregate the decisions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProduct Development and Customization
