A Game-Theoretic Perspective of Generalization in Reinforcement Learning
Chang Yang, Ruiyu Wang, Xinrun Wang, Zhen Wang

TL;DR
This paper introduces a game-theoretic framework called GiRL for analyzing and improving generalization in reinforcement learning by training agents against adversaries manipulating task distributions, unifying various schemes.
Contribution
It proposes a unified game-theoretic framework for RL generalization and adapts PSRO with novel modifications for effective training and testing strategies.
Findings
Outperforms existing baselines like MAML in MuJoCo environments.
Unifies multiple RL generalization schemes under a single framework.
Demonstrates the effectiveness of the proposed methods through extensive experiments.
Abstract
Generalization in reinforcement learning (RL) is of importance for real deployment of RL algorithms. Various schemes are proposed to address the generalization issues, including transfer learning, multi-task learning and meta learning, as well as the robust and adversarial reinforcement learning. However, there is not a unified formulation of the various schemes, as well as the comprehensive comparisons of methods across different schemes. In this work, we propose a game-theoretic framework for the generalization in reinforcement learning, named GiRL, where an RL agent is trained against an adversary over a set of tasks, where the adversary can manipulate the distributions over tasks within a given threshold. With different configurations, GiRL can reduce the various schemes mentioned above. To solve GiRL, we adapt the widely-used method in game theory, policy space response oracle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Machine Learning and Data Classification
MethodsModel-Agnostic Meta-Learning
