Incentive design for learning in user-recommendation systems with   time-varying states

Deepanshu Vasal; Vijay Subramanian; Achilleas Anastasopoulos

arXiv:1804.05083·cs.GT·April 17, 2018

Incentive design for learning in user-recommendation systems with time-varying states

Deepanshu Vasal, Vijay Subramanian, Achilleas Anastasopoulos

PDF

TL;DR

This paper studies how to design incentives in user-recommendation systems to encourage truthful information sharing among strategic users with private signals, enabling effective learning of a time-varying state.

Contribution

It formulates the problem as a decentralized stochastic control problem and proposes incentive schemes that align user incentives with the system's learning objectives.

Findings

01

Optimal policies for the team problem are characterized.

02

Incentive schemes ensure truthful signal revelation with minimal payments.

03

The gap between strategic and team objectives is kept small.

Abstract

We consider the problem of how strategic users with asymmetric information can learn an underlying time varying state in a user-recommendation system. Users who observe private signals about the state, sequentially make a decision about buying a product whose value varies with time in an ergodic manner. We formulate the team problem as an instance of decentralized stochastic control problem and characterize its optimal policies. With strategic users, we design incentives such that users reveal their true private signals, so that the gap between the strategic and team objective is small and the overall expected incentive payments are also small.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.