No-regret incentive-compatible online learning under exact truthfulness with non-myopic experts
Junpei Komiyama, Nishant A. Mehta, Ali Mortazavi

TL;DR
This paper introduces the first no-regret, exactly truthful online mechanism for non-myopic experts, achieving low belief regret in forecasting, with novel tail bounds for Poisson binomial variables and extensions to bandit settings.
Contribution
It develops the first no-regret, exactly truthful mechanism for non-myopic experts in online forecasting, using an online I-ELF approach and new probabilistic tail bounds.
Findings
Achieves (\u007F\u221A T N) regret in full-information setting.
Extends to bandit setting with (T^{2/3} N^{1/3}) regret.
Introduces new tail bounds for Poisson binomial random variables.
Abstract
We study an online forecasting setting in which, over rounds, strategic experts each report a forecast to a mechanism, the mechanism selects one forecast, and then the outcome is revealed. In any given round, each expert has a belief about the outcome, but the expert wishes to select its report so as to maximize the total number of times it is selected. The goal of the mechanism is to obtain low belief regret: the difference between its cumulative loss (based on its selected forecasts) and the cumulative loss of the best expert in hindsight (as measured by the experts' beliefs). We consider exactly truthful mechanisms for non-myopic experts, meaning that truthfully reporting its belief strictly maximizes the expert's subjective probability of being selected in any future round. Even in the full-information setting, it is an open problem to obtain the first no-regret exactly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Cognitive Radio Networks and Spectrum Sensing
