# Geometric Estimation of Multivariate Dependency

**Authors:** Salimeh Yasaei Sekeh, Alfred O. Hero

arXiv: 1905.08594 · 2019-10-02

## TL;DR

This paper introduces a scalable geometric estimator for dependency between multivariate samples based on minimal spanning trees, which converges to a new measure called geometric mutual information, avoiding density estimation.

## Contribution

It proposes a novel geometric dependency estimator using MSTs that converges to GMI, a divergence measure equivalent to Henze-Penrose divergence, with proven asymptotic properties.

## Key findings

- Estimator converges to geometric mutual information (GMI).
- Method is scalable and does not require density estimation.
- Experimental results demonstrate advantages over existing methods.

## Abstract

This paper proposes a geometric estimator of dependency between a pair of multivariate samples. The proposed estimator of dependency is based on a randomly permuted geometric graph (the minimal spanning tree) over the two multivariate samples. This estimator converges to a quantity that we call the geometric mutual information (GMI), which is equivalent to the Henze-Penrose divergence [1] between the joint distribution of the multivariate samples and the product of the marginals. The GMI has many of the same properties as standard MI but can be estimated from empirical data without density estimation; making it scalable to large datasets. The proposed empirical estimator of GMI is simple to implement, involving the construction of an MST spanning over both the original data and a randomly permuted version of this data. We establish asymptotic convergence of the estimator and convergence rates of the bias and variance for smooth multivariate density functions belonging to a H\"{o}lder class. We demonstrate the advantages of our proposed geometric dependency estimator in a series of experiments.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.08594/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1905.08594/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/1905.08594/full.md

---
Source: https://tomesphere.com/paper/1905.08594