Agglomerative Info-Clustering

Chung Chan; Ali Al-Bashabsheh; Qiaoqiao Zhou

arXiv:1701.04926·cs.IT·February 27, 2017·1 cites

Agglomerative Info-Clustering

Chung Chan, Ali Al-Bashabsheh, Qiaoqiao Zhou

PDF

Open Access

TL;DR

This paper introduces an agglomerative info-clustering method for random variables that merges clusters based on maximum multivariate mutual information, enabling earlier stopping and improved efficiency.

Contribution

It presents a novel agglomerative clustering algorithm for info-clustering that leverages submodularity and duality, offering computational advantages over previous methods.

Findings

01

Efficient algorithm derived using submodularity of entropy

02

Allows earlier stopping when desired cluster size and accuracy are reached

03

Improves computational efficiency over existing info-clustering methods

Abstract

An agglomerative clustering of random variables is proposed, where clusters of random variables sharing the maximum amount of multivariate mutual information are merged successively to form larger clusters. Compared to the previous info-clustering algorithms, the agglomerative approach allows the computation to stop earlier when clusters of desired size and accuracy are obtained. An efficient algorithm is also derived based on the submodularity of entropy and the duality between the principal sequence of partitions and the principal sequence for submodular functions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Data Management and Algorithms