Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End   Policy and Optimistic Smooth Fictitious Play

Wei Xi; Yongxin Zhang; Changnan Xiao; Xuefeng Huang; Shihong Deng,; Haowei Liang; Jie Chen; Peng Sun

arXiv:2303.04096·cs.LG·March 8, 2023·5 cites

Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play

Wei Xi, Yongxin Zhang, Changnan Xiao, Xuefeng Huang, Shihong Deng,, Haowei Liang, Jie Chen, Peng Sun

PDF

Open Access 1 Repo

TL;DR

This paper introduces an end-to-end policy and an optimistic smooth fictitious play algorithm to master a multi-stage strategy card game, achieving state-of-the-art results and winning championships.

Contribution

It presents a novel approach combining deep reinforcement learning with fictitious play tailored for multi-stage games with varying observation and action spaces.

Findings

01

Wins double championships at COG2022

02

Outperforms existing methods on Legends of Code and Magic

03

Demonstrates effectiveness of the proposed algorithms in complex multi-stage settings

Abstract

Deep Reinforcement Learning combined with Fictitious Play shows impressive results on many benchmark games, most of which are, however, single-stage. In contrast, real-world decision making problems may consist of multiple stages, where the observation spaces and the action spaces can be completely different across stages. We study a two-stage strategy card game Legends of Code and Magic and propose an end-to-end policy to address the difficulties that arise in multi-stage game. We also propose an optimistic smooth fictitious play algorithm to find the Nash Equilibrium for the two-player game. Our approach wins double championships of COG2022 competition. Extensive studies verify and show the advancement of our approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ronaldosvieira/gym-locm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Human Pose and Action Recognition