Generalized Low-Rank Matrix Contextual Bandits with Graph Information

Yao Wang; Jiannan Li; Yue Kang; Shanxing Gao; Zhenxin Xiao

arXiv:2507.17528·cs.LG·July 24, 2025

Generalized Low-Rank Matrix Contextual Bandits with Graph Information

Yao Wang, Jiannan Li, Yue Kang, Shanxing Gao, Zhenxin Xiao

PDF

Open Access

TL;DR

This paper introduces a novel matrix contextual bandit algorithm that effectively combines low-rank structures with graph information, improving decision-making performance in applications like recommender systems.

Contribution

It proposes a unified framework integrating low-rank and graph data into matrix CB, with theoretical regret bounds and practical validation.

Findings

01

Outperforms existing methods in cumulative regret bounds.

02

Effectively utilizes graph information for better decision policies.

03

Validated on synthetic and real-world datasets.

Abstract

The matrix contextual bandit (CB), as an extension of the well-known multi-armed bandit, is a powerful framework that has been widely applied in sequential decision-making scenarios involving low-rank structure. In many real-world scenarios, such as online advertising and recommender systems, additional graph information often exists beyond the low-rank structure, that is, the similar relationships among users/items can be naturally captured through the connectivity among nodes in the corresponding graphs. However, existing matrix CB methods fail to explore such graph information, and thereby making them difficult to generate effective decision-making policies. To fill in this void, we propose in this paper a novel matrix CB algorithmic framework that builds upon the classical upper confidence bound (UCB) framework. This new framework can effectively integrate both the low-rank…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management