The use of entropy to measure structural diversity

L. Masisi; V. Nelwamondo; T. Marwala

arXiv:0810.3525·cs.LG·October 21, 2008

The use of entropy to measure structural diversity

L. Masisi, V. Nelwamondo, T. Marwala

PDF

Open Access

TL;DR

This paper compares entropy-based measures to quantify the structural diversity of classifier ensembles, demonstrating that higher diversity correlates with improved accuracy and using genetic algorithms to optimize ensemble composition.

Contribution

It introduces a novel application of entropy measures and information theory to assess and optimize the diversity of classifier ensembles.

Findings

01

Higher diversity indexes lead to increased ensemble accuracy.

02

Ensembles with similar classifiers perform poorly.

03

Genetic algorithms effectively optimize ensemble diversity.

Abstract

In this paper entropy based methods are compared and used to measure structural diversity of an ensemble of 21 classifiers. This measure is mostly applied in ecology, whereby species counts are used as a measure of diversity. The measures used were Shannon entropy, Simpsons and the Berger Parker diversity indexes. As the diversity indexes increased so did the accuracy of the ensemble. An ensemble dominated by classifiers with the same structure produced poor accuracy. Uncertainty rule from information theory was also used to further define diversity. Genetic algorithms were used to find the optimal ensemble by using the diversity indices as the cost function. The method of voting was used to aggregate the decisions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProduct Development and Customization