No-regret incentive-compatible online learning under exact truthfulness   with non-myopic experts

Junpei Komiyama; Nishant A. Mehta; Ali Mortazavi

arXiv:2502.11483·cs.LG·February 18, 2025

No-regret incentive-compatible online learning under exact truthfulness with non-myopic experts

Junpei Komiyama, Nishant A. Mehta, Ali Mortazavi

PDF

Open Access

TL;DR

This paper introduces the first no-regret, exactly truthful online mechanism for non-myopic experts, achieving low belief regret in forecasting, with novel tail bounds for Poisson binomial variables and extensions to bandit settings.

Contribution

It develops the first no-regret, exactly truthful mechanism for non-myopic experts in online forecasting, using an online I-ELF approach and new probabilistic tail bounds.

Findings

01

Achieves (\u007F\u221A T N) regret in full-information setting.

02

Extends to bandit setting with (T^{2/3} N^{1/3}) regret.

03

Introduces new tail bounds for Poisson binomial random variables.

Abstract

We study an online forecasting setting in which, over $T$ rounds, $N$ strategic experts each report a forecast to a mechanism, the mechanism selects one forecast, and then the outcome is revealed. In any given round, each expert has a belief about the outcome, but the expert wishes to select its report so as to maximize the total number of times it is selected. The goal of the mechanism is to obtain low belief regret: the difference between its cumulative loss (based on its selected forecasts) and the cumulative loss of the best expert in hindsight (as measured by the experts' beliefs). We consider exactly truthful mechanisms for non-myopic experts, meaning that truthfully reporting its belief strictly maximizes the expert's subjective probability of being selected in any future round. Even in the full-information setting, it is an open problem to obtain the first no-regret exactly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Cognitive Radio Networks and Spectrum Sensing