Imitating Opponent to Win: Adversarial Policy Imitation Learning in   Two-player Competitive Games

The Viet Bui; Tien Mai; Thanh H. Nguyen

arXiv:2210.16915·cs.LG·November 1, 2022·1 cites

Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games

The Viet Bui, Tien Mai, Thanh H. Nguyen

PDF

Open Access

TL;DR

This paper introduces a novel adversarial policy learning method that uses imitation of the victim's policy to improve effectiveness in two-player competitive games, outperforming existing algorithms.

Contribution

The work proposes a new adversarial policy learning algorithm that incorporates victim policy imitation and feedback, enhancing generalization and effectiveness in multi-agent environments.

Findings

01

Outperforms state-of-the-art algorithms in MuJoCo games

02

Leverages imitation learning to better capture victim policy characteristics

03

Provides theoretical guarantees for imitation accuracy

Abstract

Recent research on vulnerabilities of deep reinforcement learning (RL) has shown that adversarial policies adopted by an adversary agent can influence a target RL agent (victim agent) to perform poorly in a multi-agent environment. In existing studies, adversarial policies are directly trained based on experiences of interacting with the victim agent. There is a key shortcoming of this approach; knowledge derived from historical interactions may not be properly generalized to unexplored policy regions of the victim agent, making the trained adversarial policy significantly less effective. In this work, we design a new effective adversarial policy learning algorithm that overcomes this shortcoming. The core idea of our new algorithm is to create a new imitator to imitate the victim agent's policy while the adversarial policy will be trained not only based on interactions with the victim…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning