Batched Adaptive Network Formation
Yan Xu, Bo Zhou

TL;DR
This paper introduces an online adaptive network formation policy that learns to optimize network output over time using a weighted stochastic block model, balancing exploration and exploitation in a batched multi-armed bandit framework.
Contribution
It develops a novel batched multi-armed bandit approach based on the WSBM, with theoretical foundations and practical algorithms for adaptive network formation.
Findings
Significant improvement in network outcomes within few batches
Accurate parameter estimation through variational methods
Effective in nonstationary, evolving agent pools
Abstract
Networks are central to many economic and organizational applications, including workplace team formation, social platform recommendations, and classroom friendship development. In these settings, networks are modeled as graphs, with agents as nodes, agent pairs as edges, and edge weights capturing pairwise production or interaction outcomes. This paper develops an adaptive, or \textit{online}, policy that learns to form increasingly effective networks as data accumulates over time, progressively improving total network output measured by the sum of edge weights. Our approach builds on the weighted stochastic block model (WSBM), which captures agents' unobservable heterogeneity through discrete latent types and models their complementarities in a flexible, nonparametric manner. We frame the online network formation problem as a non-standard \textit{batched multi-armed bandit}, where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Complex Network Analysis Techniques
