Dynamic Online Recommendation for Two-Sided Market with Bayesian Incentive Compatibility
Yuantong Li, Guang Cheng, Xiaowu Dai

TL;DR
This paper introduces a dynamic recommendation protocol that balances exploration and exploitation while ensuring incentive compatibility, validated through theoretical guarantees and real-world experiments.
Contribution
It formalizes the challenge of incentive-compatible online recommendations and proposes a novel two-stage algorithm with proven regret bounds and incentive guarantees.
Findings
Achieves $O( oot{K}{d}T^{1/2})$ regret bound.
Ensures Bayesian incentive compatibility under Gaussian priors.
Demonstrates strong incentive and regret performance in simulations and real-world data.
Abstract
Recommender systems play a crucial role in internet economies by connecting users with relevant products or services. However, designing effective recommender systems faces two key challenges: (1) the exploration-exploitation tradeoff in balancing new product exploration against exploiting known preferences, and (2) dynamic incentive compatibility in accounting for users' self-interested behaviors and heterogeneous preferences. This paper formalizes these challenges into a Dynamic Bayesian Incentive-Compatible Recommendation Protocol (DBICRP). To address the DBICRP, we propose a two-stage algorithm (RCB) that integrates incentivized exploration with an efficient offline learning component for exploitation. In the first stage, our algorithm explores available products while maintaining dynamic incentive compatibility to determine sufficient sample sizes. The second stage employs inverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Consumer Market Behavior and Pricing · Advanced Bandit Algorithms Research
