Cooperative Thresholded Lasso for Sparse Linear Bandit
Haniyeh Barghi, Xiaotong Cheng, Setareh Maghsudi

TL;DR
This paper introduces a multi-agent sparse linear bandit algorithm that uses Lasso for feature selection, reduces communication costs, and achieves near-optimal regret bounds in high-dimensional settings.
Contribution
It proposes the first distributed algorithm for sparse linear bandits with row-wise data sharing, combining Lasso and ridge regression for efficient multi-agent learning.
Findings
Achieves regret bound of order O(s_0 log d + s_0 √T) with high probability.
Reduces communication costs while maintaining near-optimal regret performance.
Demonstrates effectiveness on synthetic and real-world datasets.
Abstract
We present a novel approach to address the multi-agent sparse contextual linear bandit problem, in which the feature vectors have a high dimension whereas the reward function depends on only a limited set of features - precisely . Furthermore, the learning follows under information-sharing constraints. The proposed method employs Lasso regression for dimension reduction, allowing each agent to independently estimate an approximate set of main dimensions and share that information with others depending on the network's structure. The information is then aggregated through a specific process and shared with all agents. Each agent then resolves the problem with ridge regression focusing solely on the extracted dimensions. We represent algorithms for both a star-shaped network and a peer-to-peer network. The approaches effectively reduce communication costs while ensuring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Age of Information Optimization · Data Stream Mining Techniques
