What game are we playing? End-to-end learning in normal and extensive   form games

Chun Kai Ling; Fei Fang; J. Zico Kolter

arXiv:1805.02777·cs.LG·June 29, 2018

What game are we playing? End-to-end learning in normal and extensive form games

Chun Kai Ling, Fei Fang, J. Zico Kolter

PDF

1 Repo

TL;DR

This paper introduces a differentiable, end-to-end learning framework for inferring unknown game parameters from observations, enabling neural networks to learn and solve complex normal and extensive form games like poker and security scenarios.

Contribution

It develops a primal-dual Newton method and a backpropagation approach to compute gradients through game solutions, facilitating end-to-end learning of game parameters.

Findings

01

Effective in poker and security game tasks

02

Enables learning game parameters from observations

03

Integrates differentiable game solvers into deep networks

Abstract

Although recent work in AI has made great progress in solving large, zero-sum, extensive-form games, the underlying assumption in most past work is that the parameters of the game itself are known to the agents. This paper deals with the relatively under-explored but equally important "inverse" setting, where the parameters of the underlying game are not known to all agents, but must be learned through observations. We propose a differentiable, end-to-end learning framework for addressing this task. In particular, we consider a regularized version of the game, equivalent to a particular form of quantal response equilibrium, and develop 1) a primal-dual Newton method for finding such equilibrium points in both normal and extensive form games; and 2) a backpropagation method that lets us analytically compute gradients of all relevant game parameters through the solution itself. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lingchunkai/payoff_learning
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.