# Log-minor distributions and an application to estimating mean subsystem   entropy

**Authors:** Alice C. Schwarze, Philip S. Chodrow, and Mason A. Porter

arXiv: 1901.09456 · 2019-01-30

## TL;DR

This paper analyzes the distribution of log-determinants of principal submatrices of covariance matrices with bounded condition number, providing bounds that enable efficient estimation of subsystem entropy regardless of system size.

## Contribution

It introduces bounds on the distribution of minors and their variance, enabling accurate entropy estimation with sample sizes independent of system size.

## Key findings

- Sample size for entropy estimation is asymptotically independent of system size n.
- Number of samples needed scales linearly with subsystem size k.
- Derived bounds improve efficiency of entropy estimation in large systems.

## Abstract

A common task in physics, information theory, and other fields is the analysis of properties of subsystems of a given system. Given the covariance matrix $M$ of a system of $n$ coupled variables, the covariance matrices of the subsystems are principal submatrices of $M$. The rapid growth with $n$ of the set of principal submatrices makes it impractical to exhaustively study each submatrix for even modestly-sized systems. It is therefore of great interest to derive methods for approximating the distributions of important submatrix properties for a given matrix.   Motivated by the importance of differential entropy as a systemic measure of disorder, we study the distribution of log-determinants of principal $k\times k$ submatrices when the covariance matrix has bounded condition number. We derive upper bounds for the right tail and the variance of the distribution of minors, and we use these in turn to derive upper bounds on the standard error of the sample mean of subsystem entropy. Our results demonstrate that, despite the rapid growth of the set of subsystems with $n$, the number of samples that are needed to bound the sampling error is asymptotically independent of $n$. Instead, it is sufficient to increase the number of samples in linear proportion to $k$ to achieve a desired sampling accuracy.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.09456/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1901.09456/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1901.09456/full.md

---
Source: https://tomesphere.com/paper/1901.09456