Calibration and Internal no-Regret with Partial Monitoring

Vianney Perchet (EC)

arXiv:1006.1746·cs.GT·July 28, 2010·5 cites

Calibration and Internal no-Regret with Partial Monitoring

Vianney Perchet (EC)

PDF

Open Access

TL;DR

This paper explores the relationship between calibrated strategies and no internal regret strategies in partial monitoring games, providing methods to construct such strategies using approachability theory.

Contribution

It establishes a converse link between approachability of convex sets and calibrated strategies in the context of partial monitoring.

Findings

01

Strategies approaching convex B-sets can be derived from calibrated strategies.

02

The paper develops tools for constructing no internal regret strategies under partial monitoring.

03

It extends approachability and calibration concepts to games with incomplete information.

Abstract

Calibrated strategies can be obtained by performing strategies that have no internal regret in some auxiliary game. Such strategies can be constructed explicitly with the use of Blackwell's approachability theorem, in an other auxiliary game. We establish the converse: a strategy that approaches a convex $B$ -set can be derived from the construction of a calibrated strategy. We develop these tools in the framework of a game with partial monitoring, where players do not observe the actions of their opponents but receive random signals, to define a notion of internal regret and construct strategies that have no such regret.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Auction Theory and Applications