Distributionally Robust Model-based Reinforcement Learning with Large   State Spaces

Shyam Sundhar Ramesh; Pier Giuseppe Sessa; Yifan Hu; Andreas Krause,; Ilija Bogunovic

arXiv:2309.02236·cs.LG·June 4, 2024·1 cites

Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

Shyam Sundhar Ramesh, Pier Giuseppe Sessa, Yifan Hu, Andreas Krause,, Ilija Bogunovic

PDF

Open Access

TL;DR

This paper introduces a model-based reinforcement learning approach that uses Gaussian Processes and distributional robustness concepts to efficiently learn policies in large state spaces, demonstrating robustness and sample efficiency.

Contribution

It proposes a novel distributionally robust model-based RL method with theoretical sample complexity bounds, applicable to large, continuous state spaces and beyond linear dynamics.

Findings

01

Demonstrates robustness to distributional shifts

02

Achieves superior sample efficiency

03

Provides theoretical guarantees for near-optimal policies

Abstract

Three major challenges in reinforcement learning are the complex dynamical systems with large state spaces, the costly data acquisition processes, and the deviation of real-world dynamics from the training environment deployment. To overcome these issues, we study distributionally robust Markov decision processes with continuous state spaces under the widely used Kullback-Leibler, chi-square, and total variation uncertainty sets. We propose a model-based approach that utilizes Gaussian Processes and the maximum variance reduction algorithm to efficiently learn multi-output nominal transition dynamics, leveraging access to a generative model (i.e., simulator). We further demonstrate the statistical sample complexity of the proposed method for different uncertainty sets. These complexity bounds are independent of the number of states and extend beyond linear dynamics, ensuring the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Reinforcement Learning in Robotics