Partitioning Relational Matrices of Similarities or Dissimilarities   using the Value of Information

Isaac J. Sledge; Jose C. Principe

arXiv:1710.10381·cs.AI·October 31, 2017

Partitioning Relational Matrices of Similarities or Dissimilarities using the Value of Information

Isaac J. Sledge, Jose C. Principe

PDF

TL;DR

This paper introduces an information-theoretic clustering method for relational matrices that adaptively determines the number of clusters without prior specification, using a deterministic annealing approach.

Contribution

It presents a novel clustering technique based on the value of information, enabling data-driven determination of cluster number and improved partitioning of similarity or dissimilarity matrices.

Findings

01

Automatically determines the number of clusters during annealing.

02

Identifies the global-best partition effectively.

03

Avoids the need for pre-specifying cluster count.

Abstract

In this paper, we provide an approach to clustering relational matrices whose entries correspond to either similarities or dissimilarities between objects. Our approach is based on the value of information, a parameterized, information-theoretic criterion that measures the change in costs associated with changes in information. Optimizing the value of information yields a deterministic annealing style of clustering with many benefits. For instance, investigators avoid needing to a priori specify the number of clusters, as the partitions naturally undergo phase changes, during the annealing process, whereby the number of clusters changes in a data-driven fashion. The global-best partition can also often be identified.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.