Generalized Low-Rank Matrix Contextual Bandits with Graph Information
Yao Wang, Jiannan Li, Yue Kang, Shanxing Gao, Zhenxin Xiao

TL;DR
This paper introduces a novel matrix contextual bandit algorithm that effectively combines low-rank structures with graph information, improving decision-making performance in applications like recommender systems.
Contribution
It proposes a unified framework integrating low-rank and graph data into matrix CB, with theoretical regret bounds and practical validation.
Findings
Outperforms existing methods in cumulative regret bounds.
Effectively utilizes graph information for better decision policies.
Validated on synthetic and real-world datasets.
Abstract
The matrix contextual bandit (CB), as an extension of the well-known multi-armed bandit, is a powerful framework that has been widely applied in sequential decision-making scenarios involving low-rank structure. In many real-world scenarios, such as online advertising and recommender systems, additional graph information often exists beyond the low-rank structure, that is, the similar relationships among users/items can be naturally captured through the connectivity among nodes in the corresponding graphs. However, existing matrix CB methods fail to explore such graph information, and thereby making them difficult to generate effective decision-making policies. To fill in this void, we propose in this paper a novel matrix CB algorithmic framework that builds upon the classical upper confidence bound (UCB) framework. This new framework can effectively integrate both the low-rank…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management
