Socially-Optimal Mechanism Design for Incentivized Online Learning

Zhiyuan Wang; Lin Gao; Jianwei Huang

arXiv:2112.14338·cs.GT·December 30, 2021

Socially-Optimal Mechanism Design for Incentivized Online Learning

Zhiyuan Wang, Lin Gao, Jianwei Huang

PDF

Open Access

TL;DR

This paper introduces a socially-optimal incentive mechanism for online learning scenarios involving selfish agents, ensuring fairness and incentive compatibility while achieving near-optimal social performance in applications like edge computing.

Contribution

It develops a novel incentivized online learning framework with a mechanism that guarantees fairness, incentive compatibility, and voluntary participation, approaching the theoretical social performance bound.

Findings

01

Mechanism achieves asymptotic performance matching state-of-the-art benchmarks.

02

Larger agent crowds improve the mechanism's social performance.

03

Numerical results confirm advantages in large-scale edge computing.

Abstract

Multi-arm bandit (MAB) is a classic online learning framework that studies the sequential decision-making in an uncertain environment. The MAB framework, however, overlooks the scenario where the decision-maker cannot take actions (e.g., pulling arms) directly. It is a practically important scenario in many applications such as spectrum sharing, crowdsensing, and edge computing. In these applications, the decision-maker would incentivize other selfish agents to carry out desired actions (i.e., pulling arms on the decision-maker's behalf). This paper establishes the incentivized online learning (IOL) framework for this scenario. The key challenge to design the IOL framework lies in the tight coupling of the unknown environment learning and asymmetric information revelation. To address this, we construct a special Lagrangian function based on which we propose a socially-optimal mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing · Data Stream Mining Techniques