DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays
Bo Xia, Yilun Kong, Yongzhe Chang, Bo Yuan, Zhiheng Li, Xueqian Wang,, Bin Liang

TL;DR
DEER is a framework that improves reinforcement learning in environments with variable delays by using a pretrained encoder to map delayed states into meaningful representations, enhancing interpretability and delay resilience.
Contribution
The paper introduces DEER, a novel delay-resilient framework employing a pretrained encoder trained on delay-free data to handle variable delays in RL tasks.
Findings
DEER outperforms state-of-the-art RL algorithms in delayed scenarios.
The pretrained encoder effectively maps delayed states to improve learning stability.
DEER seamlessly integrates with existing RL algorithms without modifications.
Abstract
Classic reinforcement learning (RL) frequently confronts challenges in tasks involving delays, which cause a mismatch between received observations and subsequent actions, thereby deviating from the Markov assumption. Existing methods usually tackle this issue with end-to-end solutions using state augmentation. However, these black-box approaches often involve incomprehensible processes and redundant information in the information states, causing instability and potentially undermining the overall performance. To alleviate the delay challenges in RL, we propose , a framework designed to effectively enhance the interpretability and address the random delay issues. DEER employs a pretrained encoder to map delayed states, along with their variable-length past action sequences resulting from different delays, into hidden states, which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
