Hierarchical Clustering Based on Mutual Information
Alexander Kraskov, Harald St\"ogbauer, Ralph G. Andrzejak, and Peter, Grassberger

TL;DR
This paper introduces a hierarchical clustering method called mutual information clustering (MIC) that uses mutual information as a similarity measure, applicable to probabilistic and algorithmic data, demonstrated on phylogenetic trees and fetal ECG reconstruction.
Contribution
The paper presents a novel MIC algorithm leveraging mutual information's grouping property for hierarchical clustering in bioinformatics.
Findings
Successfully reconstructed mammal phylogenetic trees from mitochondrial DNA.
Reconstructed fetal ECG from ICA outputs.
Provided software tools for MI estimation and clustering.
Abstract
Motivation: Clustering is a frequently used concept in variety of bioinformatical applications. We present a new method for hierarchical clustering of data called mutual information clustering (MIC) algorithm. It uses mutual information (MI) as a similarity measure and exploits its grouping property: The MI between three objects X, Y, and Z is equal to the sum of the MI between X and Y, plus the MI between Z and the combined object (XY). Results: We use this both in the Shannon (probabilistic) version of information theory, where the "objects" are probability distributions represented by random samples, and in the Kolmogorov (algorithmic) version, where the "objects" are symbol sequences. We apply our method to the construction of mammal phylogenetic trees from mitochondrial DNA sequences and we reconstruct the fetal ECG from the output of independent components analysis (ICA) applied…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Machine Learning in Bioinformatics · Genomics and Phylogenetic Studies
