Search in Imperfect Information Games

Martin Schmid

arXiv:2111.05884·cs.AI·November 12, 2021·1 cites

Search in Imperfect Information Games

Martin Schmid

PDF

Open Access

TL;DR

This paper explores the development of sound search algorithms tailored for imperfect information games, extending traditional perfect information search methods to more realistic, uncertain environments.

Contribution

It introduces novel sound search techniques specifically designed for imperfect information games, addressing a significant gap in existing game AI research.

Findings

01

Demonstrates the effectiveness of the proposed search methods in complex imperfect information scenarios.

02

Provides theoretical foundations ensuring soundness of search algorithms in uncertain environments.

03

Shows improved decision-making performance over previous approaches in benchmark tests.

Abstract

From the very dawn of the field, search with value functions was a fundamental concept of computer games research. Turing's chess algorithm from 1950 was able to think two moves ahead, and Shannon's work on chess from $1950$ includes an extensive section on evaluation functions to be used within a search. Samuel's checkers program from 1959 already combines search and value functions that are learned through self-play and bootstrapping. TD-Gammon improves upon those ideas and uses neural networks to learn those complex value functions -- only to be again used within search. The combination of decision-time search and value functions has been present in the remarkable milestones where computers bested their human counterparts in long standing challenging games -- DeepBlue for Chess and AlphaGo for Go. Until recently, this powerful framework of search aided with (learned) value functions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Sports Analytics and Performance

MethodsDense Connections · Feedforward Network · Accumulating Eligibility Trace · TD Lambda · TD-Gammon