TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios

Wentao Xu; Zhongming Yao; Weihao Li; Zhenghang Song; Yumeng Song; Tianyi Li; Yushuai Li

arXiv:2602.13040·cs.LG·February 16, 2026

TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios

Wentao Xu, Zhongming Yao, Weihao Li, Zhenghang Song, Yumeng Song, Tianyi Li, Yushuai Li

PDF

Open Access

TL;DR

This paper introduces TCRL, a novel adversarial training framework that enhances the robustness of constrained reinforcement learning policies against temporally coupled perturbations in worst-case scenarios.

Contribution

TCRL is the first to explicitly model and defend against temporally coupled adversarial perturbations in constrained reinforcement learning.

Findings

01

TCRL outperforms existing methods in robustness against temporally coupled attacks.

02

TCRL maintains reward unpredictability while ensuring safety under worst-case perturbations.

03

Experimental results validate TCRL's effectiveness across various CRL tasks.

Abstract

Constrained Reinforcement Learning (CRL) aims to optimize decision-making policies under constraint conditions, making it highly applicable to safety-critical domains such as autonomous driving, robotics, and power grid management. However, existing robust CRL approaches predominantly focus on single-step perturbations and temporally independent adversarial models, lacking explicit modeling of robustness against temporally coupled perturbations. To tackle these challenges, we propose TCRL, a novel temporal-coupled adversarial training framework for robust constrained reinforcement learning (TCRL) in worst-case scenarios. First, TCRL introduces a worst-case-perceived cost constraint function that estimates safety costs under temporally coupled perturbations without the need to explicitly model adversarial attackers. Second, TCRL establishes a dual-constraint defense mechanism on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Smart Grid Security and Resilience · Reinforcement Learning in Robotics