A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning

Zhengpeng Xie; Yulong Zhang

arXiv:2501.17384·cs.LG·October 9, 2025

A Dual-Agent Adversarial Framework for Robust Generalization in Deep Reinforcement Learning

Zhengpeng Xie, Yulong Zhang

PDF

Open Access

TL;DR

This paper introduces a dual-agent adversarial framework in deep reinforcement learning that enhances generalization by encouraging agents to learn robust policies against irrelevant high-dimensional features, demonstrated on Procgen benchmarks.

Contribution

The paper proposes a novel dual-agent adversarial learning framework that improves RL generalization without human priors, applicable across various algorithms like PPO.

Findings

01

Significant performance improvements on Procgen benchmark

02

Enhanced robustness against irrelevant features in high-dimensional observations

03

Applicable to multiple RL algorithms, notably PPO

Abstract

Recently, empowered with the powerful capabilities of neural networks, reinforcement learning (RL) has successfully tackled numerous challenging tasks. However, while these models demonstrate enhanced decision-making abilities, they are increasingly prone to overfitting. For instance, a trained RL model often fails to generalize to even minor variations of the same task, such as a change in background color or other minor semantic differences. To address this issue, we propose a dual-agent adversarial policy learning framework, which allows agents to spontaneously learn the underlying semantics without introducing any human prior knowledge. Specifically, our framework involves a game process between two agents: each agent seeks to maximize the impact of perturbing on the opponent's policy by producing representation differences for the same state, while maintaining its own stability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Fault Detection and Control Systems