Analysis and Improvement of Adversarial Training in DQN Agents With   Adversarially-Guided Exploration (AGE)

Vahid Behzadan; William Hsu

arXiv:1906.01119·cs.LG·June 5, 2019·5 cites

Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)

Vahid Behzadan, William Hsu

PDF

Open Access

TL;DR

This paper analyzes the robustness of DQN agents under adversarial training, introduces a novel exploration method called AGE to improve training efficiency, and empirically evaluates its effectiveness against traditional exploration strategies.

Contribution

It provides a formal analysis of adversarial training in DQNs and proposes AGE, a new exploration mechanism that enhances training efficiency and robustness.

Findings

01

AGE outperforms traditional exploration methods in robustness.

02

Adversarial training improves DQN robustness to state perturbations.

03

AGE demonstrates better sample efficiency in experiments.

Abstract

This paper investigates the effectiveness of adversarial training in enhancing the robustness of Deep Q-Network (DQN) policies to state-space perturbations. We first present a formal analysis of adversarial training in DQN agents and its performance with respect to the proportion of adversarial perturbations to nominal observations used for training. Next, we consider the sample-inefficiency of current adversarial training techniques, and propose a novel Adversarially-Guided Exploration (AGE) mechanism based on a modified hybrid of the $ϵ$ -greedy algorithm and Boltzmann exploration. We verify the feasibility of this exploration mechanism through experimental evaluation of its performance in comparison with the traditional decaying $ϵ$ -greedy and parameter-space noise exploration algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Reinforcement Learning in Robotics

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network