Graph Neural Thompson Sampling
Shuang Wu, Arash A. Amini

TL;DR
This paper introduces GNN-TS, a graph neural network-based Thompson Sampling algorithm for online decision-making on graph data, achieving state-of-the-art regret bounds and demonstrating competitive empirical performance.
Contribution
The paper proposes GNN-TS, a novel GNN-powered Thompson Sampling algorithm for graph action bandits, with theoretical regret guarantees and scalable empirical results.
Findings
Achieves sub-linear regret of order d7((d7d T)^{1/2})
Regret bound is independent of the number of graph nodes
Empirical results show competitive performance and scalability
Abstract
We consider an online decision-making problem with a reward function defined over graph-structured data. We formally formulate the problem as an instance of graph action bandit. We then propose \texttt{GNN-TS}, a Graph Neural Network (GNN) powered Thompson Sampling (TS) algorithm which employs a GNN approximator for estimating the mean reward function and the graph neural tangent features for uncertainty estimation. We prove that, under certain boundness assumptions on the reward function, GNN-TS achieves a state-of-the-art regret bound which is (1) sub-linear of order in the number of interaction rounds, , and a notion of effective dimension , and (2) independent of the number of graph nodes. Empirical results validate that our proposed \texttt{GNN-TS} exhibits competitive performance and scales well on graph action bandit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM
MethodsGraph Neural Network
