Learning to Play Stochastic Two-player Perfect-Information Games without   Knowledge

Quentin Cohen-Solal; Tristan Cazenave

arXiv:2302.04318·cs.AI·February 10, 2023

Learning to Play Stochastic Two-player Perfect-Information Games without Knowledge

Quentin Cohen-Solal, Tristan Cazenave

PDF

Open Access

TL;DR

This paper extends the Descent framework to stochastic two-player perfect-information games, proposing two methods and demonstrating superior performance of the generalized approach in the game EinStein wurfelt nicht! compared to existing algorithms.

Contribution

The paper introduces a novel generalization of the Descent algorithm to stochastic games and compares it with an approximation method, advancing learning without prior knowledge.

Findings

01

The generalized Descent algorithm outperforms Expectiminimax and Polygames in the tested game.

02

The deterministic approximation method yields promising results, indicating potential in specific contexts.

03

The approach enables learning and planning in stochastic games without prior domain knowledge.

Abstract

In this paper, we extend the Descent framework, which enables learning and planning in the context of two-player games with perfect information, to the framework of stochastic games. We propose two ways of doing this, the first way generalizes the search algorithm, i.e. Descent, to stochastic games and the second way approximates stochastic games by deterministic games. We then evaluate them on the game EinStein wurfelt nicht! against state-of-the-art algorithms: Expectiminimax and Polygames (i.e. the Alpha Zero algorithm). It is our generalization of Descent which obtains the best results. The approximation by deterministic games nevertheless obtains good results, presaging that it could give better results in particular contexts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Sports Analytics and Performance