DEER: A Delay-Resilient Framework for Reinforcement Learning with   Variable Delays

Bo Xia; Yilun Kong; Yongzhe Chang; Bo Yuan; Zhiheng Li; Xueqian Wang,; Bin Liang

arXiv:2406.03102·cs.LG·June 6, 2024·2 cites

DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays

Bo Xia, Yilun Kong, Yongzhe Chang, Bo Yuan, Zhiheng Li, Xueqian Wang,, Bin Liang

PDF

Open Access

TL;DR

DEER is a framework that improves reinforcement learning in environments with variable delays by using a pretrained encoder to map delayed states into meaningful representations, enhancing interpretability and delay resilience.

Contribution

The paper introduces DEER, a novel delay-resilient framework employing a pretrained encoder trained on delay-free data to handle variable delays in RL tasks.

Findings

01

DEER outperforms state-of-the-art RL algorithms in delayed scenarios.

02

The pretrained encoder effectively maps delayed states to improve learning stability.

03

DEER seamlessly integrates with existing RL algorithms without modifications.

Abstract

Classic reinforcement learning (RL) frequently confronts challenges in tasks involving delays, which cause a mismatch between received observations and subsequent actions, thereby deviating from the Markov assumption. Existing methods usually tackle this issue with end-to-end solutions using state augmentation. However, these black-box approaches often involve incomprehensible processes and redundant information in the information states, causing instability and potentially undermining the overall performance. To alleviate the delay challenges in RL, we propose $DEER (Delay-resilient Encoder-Enhanced RL)$ , a framework designed to effectively enhance the interpretability and address the random delay issues. DEER employs a pretrained encoder to map delayed states, along with their variable-length past action sequences resulting from different delays, into hidden states, which is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics