# Bayesian contiguity constrained clustering, spanning trees and   dendrograms

**Authors:** Etienne C\^ome

arXiv: 2302.12546 · 2023-02-27

## TL;DR

This paper introduces a Bayesian approach to contiguity-constrained clustering using spanning trees, enabling exact posterior probability computation, MAP partitioning, and Bayesian dendrogram extraction, with real-world applications demonstrated.

## Contribution

It formalizes contiguity-constrained clustering in a Bayesian framework and develops algorithms for MAP partitioning and dendrogram extraction based on spanning trees.

## Key findings

- Exact posterior probabilities of partitions computed using spanning trees
- Algorithm for finding MAP partitions in the Bayesian setting
- Demonstrated effectiveness on real-world datasets

## Abstract

Clustering is a well-known and studied problem, one of its variants, called contiguity-constrained clustering, accepts as a second input a graph used to encode prior information about cluster structure by means of contiguity constraints i.e. clusters must form connected subgraphs of this graph. This paper discusses the interest of such a setting and proposes a new way to formalise it in a Bayesian setting, using results on spanning trees to compute exactly a posteriori probabilities of candidate partitions. An algorithmic solution is then investigated to find a maximum a posteriori (MAP) partition and extract a Bayesian dendrogram from it. The interest of this last tool, which is reminiscent of the classical output of a simple hierarchical clustering algorithm, is analysed. Finally, the proposed approach is demonstrated with real applications. A reference implementation of this work is available in the R package gtclust that accompanies the paper (available at http://github.com/comeetie/gtclust)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2302.12546/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/2302.12546/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/2302.12546/full.md

---
Source: https://tomesphere.com/paper/2302.12546