Anticipating Oblivious Opponents in Stochastic Games

Shadi Tasdighi Kalat; Sriram Sankaranarayanan; Ashutosh Trivedi

arXiv:2409.11671·cs.AI·September 19, 2024

Anticipating Oblivious Opponents in Stochastic Games

Shadi Tasdighi Kalat, Sriram Sankaranarayanan, Ashutosh Trivedi

PDF

Open Access

TL;DR

This paper introduces a method to predict the actions of oblivious environments in stochastic games by synthesizing an information state machine, enabling optimal policy computation to maximize rewards in various scenarios.

Contribution

It presents a novel approach for systematically anticipating oblivious environment policies using an information state machine with consistency guarantees.

Findings

01

Successfully anticipates environment policies in benchmark tasks

02

Maximizes reward in human activity scenarios

03

Provides a method for checking automaton consistency

Abstract

We present an approach for systematically anticipating the actions and policies employed by \emph{oblivious} environments in concurrent stochastic games, while maximizing a reward function. Our main contribution lies in the synthesis of a finite \emph{information state machine} whose alphabet ranges over the actions of the environment. Each state of the automaton is mapped to a belief state about the policy used by the environment. We introduce a notion of consistency that guarantees that the belief states tracked by our automaton stays within a fixed distance of the precise belief state obtained by knowledge of the full history. We provide methods for checking consistency of an automaton and a synthesis approach which upon successful termination yields such a machine. We show how the information state machine yields an MDP that serves as the starting point for computing optimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications