Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization
Young-Geun Choi, Gi-Soo Kim, Seunghoon Paik, Myunghee Cho Paik

TL;DR
This paper introduces SemiGraphTS, a novel Thompson-sampling algorithm for semi-parametric contextual bandits with graph structures, addressing non-stationarity and social interactions in human behavior.
Contribution
It is the first to propose a graph-based semi-parametric contextual bandit algorithm, deriving regret bounds that incorporate graph structure and model order.
Findings
SemiGraphTS outperforms existing algorithms in simulations.
The regret bound depends on graph structure and model order.
Real data experiments validate the effectiveness of the proposed method.
Abstract
Non-stationarity is ubiquitous in human behavior and addressing it in the contextual bandits is challenging. Several works have addressed the problem by investigating semi-parametric contextual bandits and warned that ignoring non-stationarity could harm performances. Another prevalent human behavior is social interaction which has become available in a form of a social network or graph structure. As a result, graph-based contextual bandits have received much attention. In this paper, we propose "SemiGraphTS," a novel contextual Thompson-sampling algorithm for a graph-based semi-parametric reward model. Our algorithm is the first to be proposed in this setting. We derive an upper bound of the cumulative regret that can be expressed as a multiple of a factor depending on the graph structure and the order for the semi-parametric model without a graph. We evaluate the proposed and existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · COVID-19 epidemiological studies · Advanced Causal Inference Techniques
