Adversarially Robust Decision Transformer
Xiaohang Tang, Afonso Marques, Parameswaran Kamalaruban, Ilija, Bogunovic

TL;DR
This paper introduces ARDT, a decision transformer variant designed for adversarial environments, which improves robustness by conditioning on worst-case returns and achieving maximin strategies in sequential decision tasks.
Contribution
The paper proposes ARDT, a novel adversarially robust decision transformer that learns minimax returns, enhancing robustness against adversaries in offline reinforcement learning.
Findings
ARDT achieves higher worst-case returns than existing methods.
ARDT can generate maximin (Nash Equilibrium) strategies.
ARDT demonstrates superior robustness in large-scale and continuous adversarial environments.
Abstract
Decision Transformer (DT), as one of the representative Reinforcement Learning via Supervised Learning (RvS) methods, has achieved strong performance in offline learning tasks by leveraging the powerful Transformer architecture for sequential decision-making. However, in adversarial environments, these methods can be non-robust, since the return is dependent on the strategies of both the decision-maker and adversary. Training a probabilistic model conditioned on observed return to predict action can fail to generalize, as the trajectories that achieve a return in the dataset might have done so due to a suboptimal behavior adversary. To address this, we propose a worst-case-aware RvS algorithm, the Adversarially Robust Decision Transformer (ARDT), which learns and conditions the policy on in-sample minimax returns-to-go. ARDT aligns the target return with the worst-case return learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Neural Networks and Applications
MethodsAttention Is All You Need · Adam · Label Smoothing · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Dense Connections
