Networked Information Aggregation for Binary Classification
MohammadHossein Bateni, Zahra Hadizadeh, MohammadTaghi Hajiaghayi, Mahdi JafariRaviz, Shayan Taherijam

TL;DR
This paper analyzes how a sequential, networked logistic regression protocol on a DAG can aggregate information, identifying depth as a key bottleneck for achieving low excess loss.
Contribution
It extends previous linear regression results to classification, providing bounds on excess loss and highlighting the impact of network depth on information aggregation.
Findings
Excess loss is bounded by $O(M/\sqrt{D})$ under certain feature observation conditions.
Instances exist with excess loss at least $\Omega(k/D)$, showing a fundamental bottleneck.
Network depth critically limits the effectiveness of distributed logistic regression.
Abstract
We study networked binary classification on a directed acyclic graph (DAG) where each agent observes only a subset of the feature columns of a shared dataset. Agents act sequentially along the DAG: each receives prediction columns from its parents (if any), augments its local features with these columns, fits a logistic predictor by minimizing binary cross-entropy (BCE), and forwards its prediction column to its outgoing neighbors. We ask whether this sequential distributed training procedure achieves information aggregation, meaning that some agent attains small excess loss compared to the best logistic predictor trained with access to all feature columns. This question was studied for linear regression under squared loss by Kearns, Roth, and Ryu (SODA 2026). Extending their guarantees to classification is nontrivial because their analysis relies on quadratic structure that does not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
