Socially-Optimal Mechanism Design for Incentivized Online Learning
Zhiyuan Wang, Lin Gao, Jianwei Huang

TL;DR
This paper introduces a socially-optimal incentive mechanism for online learning scenarios involving selfish agents, ensuring fairness and incentive compatibility while achieving near-optimal social performance in applications like edge computing.
Contribution
It develops a novel incentivized online learning framework with a mechanism that guarantees fairness, incentive compatibility, and voluntary participation, approaching the theoretical social performance bound.
Findings
Mechanism achieves asymptotic performance matching state-of-the-art benchmarks.
Larger agent crowds improve the mechanism's social performance.
Numerical results confirm advantages in large-scale edge computing.
Abstract
Multi-arm bandit (MAB) is a classic online learning framework that studies the sequential decision-making in an uncertain environment. The MAB framework, however, overlooks the scenario where the decision-maker cannot take actions (e.g., pulling arms) directly. It is a practically important scenario in many applications such as spectrum sharing, crowdsensing, and edge computing. In these applications, the decision-maker would incentivize other selfish agents to carry out desired actions (i.e., pulling arms on the decision-maker's behalf). This paper establishes the incentivized online learning (IOL) framework for this scenario. The key challenge to design the IOL framework lies in the tight coupling of the unknown environment learning and asymmetric information revelation. To address this, we construct a special Lagrangian function based on which we propose a socially-optimal mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing · Data Stream Mining Techniques
