Normalized Mutual Information to evaluate overlapping community finding   algorithms

Aaron F. McDaid; Derek Greene; Neil Hurley

arXiv:1110.2515·physics.soc-ph·August 5, 2013·231 cites

Normalized Mutual Information to evaluate overlapping community finding algorithms

Aaron F. McDaid, Derek Greene, Neil Hurley

PDF

Open Access 5 Repos

TL;DR

This paper evaluates the use of normalized mutual information for measuring the accuracy of overlapping community detection algorithms, highlighting issues with current normalization methods and proposing improvements.

Contribution

It identifies problems with existing normalized mutual information measures and proposes a corrected normalization approach for better accuracy in evaluating overlapping clustering.

Findings

01

Normalized mutual information can behave unintuitively with current normalization.

02

A more conventional normalization improves the measure's behavior.

03

Comparison with Omega index shows the effectiveness of the proposed normalization.

Abstract

Given the increasing popularity of algorithms for overlapping clustering, in particular in social network analysis, quantitative measures are needed to measure the accuracy of a method. Given a set of true clusters, and the set of clusters found by an algorithm, these sets of clusters must be compared to see how similar or different the sets are. A normalized measure is desirable in many contexts, for example assigning a value of 0 where the two sets are totally dissimilar, and 1 where they are identical. A measure based on normalized mutual information, [1], has recently become popular. We demonstrate unintuitive behaviour of this measure, and show how this can be corrected by using a more conventional normalization. We compare the results to that of other measures, such as the Omega index [2].

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Opinion Dynamics and Social Influence