Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh, Pier Giuseppe Sessa, Yifan Hu, Andreas Krause,, Ilija Bogunovic

TL;DR
This paper introduces a model-based reinforcement learning approach that uses Gaussian Processes and distributional robustness concepts to efficiently learn policies in large state spaces, demonstrating robustness and sample efficiency.
Contribution
It proposes a novel distributionally robust model-based RL method with theoretical sample complexity bounds, applicable to large, continuous state spaces and beyond linear dynamics.
Findings
Demonstrates robustness to distributional shifts
Achieves superior sample efficiency
Provides theoretical guarantees for near-optimal policies
Abstract
Three major challenges in reinforcement learning are the complex dynamical systems with large state spaces, the costly data acquisition processes, and the deviation of real-world dynamics from the training environment deployment. To overcome these issues, we study distributionally robust Markov decision processes with continuous state spaces under the widely used Kullback-Leibler, chi-square, and total variation uncertainty sets. We propose a model-based approach that utilizes Gaussian Processes and the maximum variance reduction algorithm to efficiently learn multi-output nominal transition dynamics, leveraging access to a generative model (i.e., simulator). We further demonstrate the statistical sample complexity of the proposed method for different uncertainty sets. These complexity bounds are independent of the number of states and extend beyond linear dynamics, ensuring the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Reinforcement Learning in Robotics
