Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls   Oracle

Enoch Hyunwook Kang; P. R. Kumar

arXiv:2301.12571·cs.LG·July 3, 2023·1 cites

Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle

Enoch Hyunwook Kang, P. R. Kumar

PDF

Open Access

TL;DR

This paper develops a theoretical framework for bounded regret in recommendation systems using synthetic control methods, addressing practical issues like unobservable covariates and user privacy, and verifies results through simulations.

Contribution

It introduces a novel approach leveraging synthetic control methods to achieve bounded regret without requiring exact linear model knowledge in recommender systems.

Findings

01

Bounded regret achieved under practical assumptions

02

Synthetic control methods effectively relax linear model requirements

03

Simulation confirms theoretical bounded regret results

Abstract

In online exploration systems where users with fixed preferences repeatedly arrive, it has recently been shown that O(1), i.e., bounded regret, can be achieved when the system is modeled as a linear contextual bandit. This result may be of interest for recommender systems, where the popularity of their items is often short-lived, as the exploration itself may be completed quickly before potential long-run non-stationarities come into play. However, in practice, exact knowledge of the linear model is difficult to justify. Furthermore, potential existence of unobservable covariates, uneven user arrival rates, interpretation of the necessary rank condition, and users opting out of private data tracking all need to be addressed for practical recommender system applications. In this work, we conduct a theoretical study to address all these issues while still achieving bounded regret. Aside…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Age of Information Optimization