Learning Data Dependency with Communication Cost
Hyeryung Jang, HyungSeok Song, Yung Yi

TL;DR
This paper explores the balance between learning accuracy and communication costs in distributed data dependency graph learning for inference, proposing optimal and heuristic algorithms with theoretical analysis and validation.
Contribution
It introduces a formal optimization framework for data dependency graph learning that accounts for message-passing costs and provides algorithms for different inference mechanisms.
Findings
Optimal polynomial-time algorithm for ASYNC-MAP
NP-hardness of SYNC-MAP with a greedy heuristic
Decay rate of graph difference probability with data samples
Abstract
In this paper, we consider the problem of recovering a graph that represents the statistical data dependency among nodes for a set of data samples generated by nodes, which provides the basic structure to perform an inference task, such as MAP (maximum a posteriori). This problem is referred to as structure learning. When nodes are spatially separated in different locations, running an inference algorithm requires a non-negligible amount of message passing, incurring some communication cost. We inevitably have the trade-off between the accuracy of structure learning and the cost we need to pay to perform a given message-passing based inference task because the learnt edge structures of data dependency and physical connectivity graph are often highly different. In this paper, we formalize this trade-off in an optimization problem which outputs the data dependency graph that jointly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Distributed Sensor Networks and Detection Algorithms · Advanced Graph Neural Networks
