ReRoGCRL: Representation-based Robustness in Goal-Conditioned   Reinforcement Learning

Xiangyu Yin; Sihao Wu; Jiaxu Liu; Meng Fang; Xingyu Zhao; Xiaowei; Huang; Wenjie Ruan

arXiv:2312.07392·cs.LG·December 21, 2023·2 cites

ReRoGCRL: Representation-based Robustness in Goal-Conditioned Reinforcement Learning

Xiangyu Yin, Sihao Wu, Jiaxu Liu, Meng Fang, Xingyu Zhao, Xiaowei, Huang, Wenjie Ruan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel attack and defense framework for Goal-Conditioned Reinforcement Learning, enhancing robustness against adversarial perturbations through semi-contrastive attacks and regularization techniques, validated across multiple algorithms.

Contribution

It proposes the Semi-Contrastive Representation attack and Adversarial Representation Tactics, novel methods to evaluate and improve GCRL robustness against adversarial attacks.

Findings

01

The attack effectively compromises GCRL policies during deployment.

02

The defense significantly improves robustness against various perturbations.

03

Experimental results outperform existing methods in robustness metrics.

Abstract

While Goal-Conditioned Reinforcement Learning (GCRL) has gained attention, its algorithmic robustness against adversarial perturbations remains unexplored. The attacks and robust representation training methods that are designed for traditional RL become less effective when applied to GCRL. To address this challenge, we first propose the Semi-Contrastive Representation attack, a novel approach inspired by the adversarial contrastive attack. Unlike existing attacks in RL, it only necessitates information from the policy function and can be seamlessly implemented during deployment. Then, to mitigate the vulnerability of existing GCRL algorithms, we introduce Adversarial Representation Tactics, which combines Semi-Contrastive Adversarial Augmentation with Sensitivity-Aware Regularizer to improve the adversarial robustness of the underlying RL agent against various types of perturbations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trustai/rerogcrl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning