Distributed Compression of Graphical Data

Payam Delgosha; Venkat Anantharam

arXiv:1802.07446·cs.IT·August 24, 2021

Distributed Compression of Graphical Data

Payam Delgosha, Venkat Anantharam

PDF

TL;DR

This paper develops a theoretical framework for the distributed compression of large-scale graphical data, extending classical information theory results to complex graph models like Erdos-Renyi and configuration models.

Contribution

It derives a rate region for distributed graph data compression, generalizing the Slepian-Wolf theorem to new graph-based models and multiple sources.

Findings

01

Characterizes the rate region for two types of graphical data models.

02

Generalizes the rate region to multiple sources.

03

Connects graph entropy with local weak limits of sparse graphs.

Abstract

In contrast to time series, graphical data is data indexed by the vertices and edges of a graph. Modern applications such as the internet, social networks, genomics and proteomics generate graphical data, often at large scale. The large scale argues for the need to compress such data for storage and subsequent processing. Since this data might have several components available in different locations, it is also important to study distributed compression of graphical data. In this paper, we derive a rate region for this problem which is a counterpart of the Slepian-Wolf theorem. We characterize the rate region when the statistical description of the distributed graphical data can be modeled as being one of two types - as a member of a sequence of marked sparse Erdos-Renyi ensembles or as a member of a sequence of marked configuration model ensembles. Our results are in terms of a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.