Adversarially Robust Decision Transformer

Xiaohang Tang; Afonso Marques; Parameswaran Kamalaruban; Ilija; Bogunovic

arXiv:2407.18414·cs.LG·November 4, 2024

Adversarially Robust Decision Transformer

Xiaohang Tang, Afonso Marques, Parameswaran Kamalaruban, Ilija, Bogunovic

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces ARDT, a decision transformer variant designed for adversarial environments, which improves robustness by conditioning on worst-case returns and achieving maximin strategies in sequential decision tasks.

Contribution

The paper proposes ARDT, a novel adversarially robust decision transformer that learns minimax returns, enhancing robustness against adversaries in offline reinforcement learning.

Findings

01

ARDT achieves higher worst-case returns than existing methods.

02

ARDT can generate maximin (Nash Equilibrium) strategies.

03

ARDT demonstrates superior robustness in large-scale and continuous adversarial environments.

Abstract

Decision Transformer (DT), as one of the representative Reinforcement Learning via Supervised Learning (RvS) methods, has achieved strong performance in offline learning tasks by leveraging the powerful Transformer architecture for sequential decision-making. However, in adversarial environments, these methods can be non-robust, since the return is dependent on the strategies of both the decision-maker and adversary. Training a probabilistic model conditioned on observed return to predict action can fail to generalize, as the trajectories that achieve a return in the dataset might have done so due to a suboptimal behavior adversary. To address this, we propose a worst-case-aware RvS algorithm, the Adversarially Robust Decision Transformer (ARDT), which learns and conditions the policy on in-sample minimax returns-to-go. ARDT aligns the target return with the worst-case return learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiaohangt/ardt
pytorchOfficial

Videos

Adversarially Robust Decision Transformer· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Neural Networks and Applications

MethodsAttention Is All You Need · Adam · Label Smoothing · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Dense Connections