Corruption-Robust Offline Reinforcement Learning with General Function Approximation
Chenlu Ye, Rui Yang, Quanquan Gu, Tong Zhang

TL;DR
This paper introduces a new offline reinforcement learning algorithm that is robust to data corruption, providing theoretical guarantees that match lower bounds in linear MDP settings.
Contribution
It proposes a corruption-robust offline RL algorithm using an uncertainty-weighting technique, with tight bounds in linear MDPs, addressing adversarial data corruption.
Findings
Achieves suboptimality bounds with additive corruption-dependent terms.
Reduces to known bounds in linear MDPs, matching lower bounds.
Provides a new method for corruption-robust offline RL with theoretical guarantees.
Abstract
We investigate the problem of corruption robustness in offline reinforcement learning (RL) with general function approximation, where an adversary can corrupt each sample in the offline dataset, and the corruption level quantifies the cumulative corruption amount over episodes and steps. Our goal is to find a policy that is robust to such corruption and minimizes the suboptimality gap with respect to the optimal policy for the uncorrupted Markov decision processes (MDPs). Drawing inspiration from the uncertainty-weighting technique from the robust online RL setting \citep{he2022nearly,ye2022corruptionrobust}, we design a new uncertainty weight iteration procedure to efficiently compute on batched samples and propose a corruption-robust algorithm for offline RL. Notably, under the assumption of single policy coverage and the knowledge of , our proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques · Advanced Causal Inference Techniques
