STEEL: Singularity-aware Reinforcement Learning
Xiaohong Chen, Zhengling Qi, Runzhe Wan

TL;DR
STEEL introduces a novel batch reinforcement learning algorithm that effectively handles singularities in data distributions, improving robustness and applicability in continuous state-action spaces with minimal data coverage assumptions.
Contribution
The paper proposes STEEL, the first RL algorithm capable of managing singularities in data distributions, with theoretical guarantees and practical adaptive variants.
Findings
STEEL outperforms existing methods in handling distribution singularities.
Theoretical regret bounds are established under singularity conditions.
Simulation and real experiments demonstrate superior robustness and performance.
Abstract
Batch reinforcement learning (RL) aims at leveraging pre-collected data to find an optimal policy that maximizes the expected total rewards in a dynamic environment. The existing methods require absolutely continuous assumption (e.g., there do not exist non-overlapping regions) on the distribution induced by target policies with respect to the data distribution over either the state or action or both. We propose a new batch RL algorithm that allows for singularity for both state and action spaces (e.g., existence of non-overlapping regions between offline data distribution and the distribution induced by the target policies) in the setting of an infinite-horizon Markov decision process with continuous states and actions. We call our algorithm STEEL: SingulariTy-awarE rEinforcement Learning. Our algorithm is motivated by a new error analysis on off-policy evaluation, where we use maximum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Smart Grid Energy Management · Energy, Environment, and Transportation Policies
