No-Regret Bayesian Recommendation to Homogeneous Users
Yiding Feng, Wei Tang, Haifeng Xu

TL;DR
This paper develops online Bayesian recommendation strategies for platforms that observe product states and aim to persuade users while learning preferences, achieving near-optimal regret bounds in an online setting.
Contribution
It introduces no-regret online Bayesian recommendation policies with provable regret bounds and formulates the problem as a linear program for improved performance.
Findings
Achieved double logarithmic regret in the number of rounds.
Established a lower bound on regret dependence on rounds.
Designed a linear programming-based policy with polynomial state dependence.
Abstract
We introduce and study the online Bayesian recommendation problem for a recommender system platform. The platform has the privilege to privately observe a utility-relevant \emph{state} of a product at each round and uses this information to make online recommendations to a stream of myopic users. This paradigm is common in a wide range of scenarios in the current Internet economy. The platform commits to an online recommendation policy that utilizes her information advantage on the product state to persuade self-interested users to follow the recommendation. Since the platform does not know users' preferences or beliefs in advance, we study the platform's online learning problem of designing an adaptive recommendation policy to persuade users while gradually learning users' preferences and beliefs en route. Specifically, we aim to design online learning policies with no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Auction Theory and Applications
