Estimating $\alpha$-Rank by Maximizing Information Gain

Tabish Rashid; Cheng Zhang; Kamil Ciosek

arXiv:2101.09178·cs.MA·January 25, 2021

Estimating $\alpha$-Rank by Maximizing Information Gain

Tabish Rashid, Cheng Zhang, Kamil Ciosek

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an efficient sampling method to estimate the $eta$-rank in unknown games by maximizing information gain, reducing the number of samples needed and incorporating prior knowledge.

Contribution

It proposes a novel Bayesian algorithm that maximizes information gain to estimate $eta$-rank with fewer samples and integrates prior assumptions about game payoffs.

Findings

01

Outperforms ResponseGraphUCB in sample efficiency

02

Provides theoretical guarantees for the estimation method

03

Focuses sampling on critical game entries

Abstract

Game theory has been increasingly applied in settings where the game is not known outright, but has to be estimated by sampling. For example, meta-games that arise in multi-agent evaluation can only be accessed by running a succession of expensive experiments that may involve simultaneous deployment of several agents. In this paper, we focus on $α$ -rank, a popular game-theoretic solution concept designed to perform well in such scenarios. We aim to estimate the $α$ -rank of the game using as few samples as possible. Our algorithm maximizes information gain between an epistemic belief over the $α$ -ranks and the observed payoff. This approach has two main benefits. First, it allows us to focus our sampling on the entries that matter the most for identifying the $α$ -rank. Second, the Bayesian formulation provides a facility to build in modeling assumptions by using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/InfoGainalpharank
noneOfficial

Videos

Estimating alpha-Rank by Maximizing Information Gain· underline

Taxonomy

TopicsNeural Networks and Applications