Networked Information Aggregation via Machine Learning
Michael Kearns, Aaron Roth, Emily Ryu

TL;DR
This paper investigates how distributed agents in a DAG can collectively learn to predict labels as accurately as if they had access to all features, analyzing the role of network structure and feature distribution.
Contribution
It provides theoretical bounds and insights on when information aggregation is possible in distributed learning over DAGs, highlighting the importance of graph depth.
Findings
Depth of the DAG is crucial for successful information aggregation.
Aggregation can fail in shallow or hub-and-spokes topologies.
Theoretical bounds are supported by extensive experiments.
Abstract
We study a distributed learning problem in which learning agents are embedded in a directed acyclic graph (DAG). There is a fixed and arbitrary distribution over feature/label pairs, and each agent or vertex in the graph is able to directly observe only a subset of the features -- potentially a different subset for every agent. The agents learn sequentially in some order consistent with a topological sort of the DAG, committing to a model mapping observations to predictions of the real-valued label. Each agent observes the predictions of their parents in the DAG, and trains their model using both the features of the instance that they directly observe, and the predictions of their parents as additional features. We ask when this process is sufficient to achieve \emph{information aggregation}, in the sense that some agent in the DAG is able to learn a model whose error is competitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting
