Distributed Compression of Graphical Data
Payam Delgosha, Venkat Anantharam

TL;DR
This paper develops a theoretical framework for the distributed compression of large-scale graphical data, extending classical information theory results to complex graph models like Erdos-Renyi and configuration models.
Contribution
It derives a rate region for distributed graph data compression, generalizing the Slepian-Wolf theorem to new graph-based models and multiple sources.
Findings
Characterizes the rate region for two types of graphical data models.
Generalizes the rate region to multiple sources.
Connects graph entropy with local weak limits of sparse graphs.
Abstract
In contrast to time series, graphical data is data indexed by the vertices and edges of a graph. Modern applications such as the internet, social networks, genomics and proteomics generate graphical data, often at large scale. The large scale argues for the need to compress such data for storage and subsequent processing. Since this data might have several components available in different locations, it is also important to study distributed compression of graphical data. In this paper, we derive a rate region for this problem which is a counterpart of the Slepian-Wolf theorem. We characterize the rate region when the statistical description of the distributed graphical data can be modeled as being one of two types - as a member of a sequence of marked sparse Erdos-Renyi ensembles or as a member of a sequence of marked configuration model ensembles. Our results are in terms of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
