To be a fast adaptive learner: using game history to defeat opponents
Guangzhao Cheng, Siliang Tang

TL;DR
This paper introduces F3, a novel framework that leverages past game history to enable AI agents to quickly adapt and outperform opponents with unknown strategies in repeated game scenarios.
Contribution
The paper proposes a new framework (F3) with an Opponent Action Estimator that effectively fuses past and current game history for rapid opponent modeling.
Findings
F3-trained agents outperform deep reinforcement learning agents in fixed turns.
F3 enables quick adaptation to unknown opponent strategies.
OAE module contains transferable meta-knowledge across different games.
Abstract
In many real-world games, such as traders repeatedly bargaining with customers, it is very hard for a single AI trader to make good deals with various customers in a few turns, since customers may adopt different strategies even the strategies they choose are quite simple. In this paper, we model this problem as fast adaptive learning in the finitely repeated games. We believe that past game history plays a vital role in such a learning procedure, and therefore we propose a novel framework (named, F3) to fuse the past and current game history with an Opponent Action Estimator (OAE) module that uses past game history to estimate the opponent's future behaviors. The experiments show that the agent trained by F3 can quickly defeat opponents who adopt unknown new strategies. The F3 trained agent obtains more rewards in a fixed number of turns than the agents that are trained by deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Evolutionary Algorithms and Applications · Reinforcement Learning in Robotics
