Reducing Exploitability with Population Based Training
Pavel Czempin, Adam Gleave

TL;DR
This paper proposes using population-based training to improve the robustness of reinforcement learning policies against diverse adversarial opponents, addressing limitations of prior adversarial training methods.
Contribution
The paper introduces a population-based training approach to enhance policy robustness by exposing agents to diverse opponents during training.
Findings
Increased robustness against adversaries with more opponent diversity.
Robustness correlates positively with the size of the opponent population.
Defense effectiveness demonstrated in two low-dimensional environments.
Abstract
Self-play reinforcement learning has achieved state-of-the-art, and often superhuman, performance in a variety of zero-sum games. Yet prior work has found that policies that are highly capable against regular opponents can fail catastrophically against adversarial policies: an opponent trained explicitly against the victim. Prior defenses using adversarial training were able to make the victim robust to a specific adversary, but the victim remained vulnerable to new ones. We conjecture this limitation was due to insufficient diversity of adversaries seen during training. We analyze a defense using population based training to pit the victim against a diverse set of opponents. We evaluate this defense's robustness against new adversaries in two low-dimensional environments. This defense increases robustness against adversaries, as measured by the number of attacker training timesteps to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Reinforcement Learning in Robotics
MethodsPopulation Based Training
