Multi-Fidelity Reinforcement Learning with Gaussian Processes

Varun Suryan; Nahush Gondhalekar; Pratap Tokekar

arXiv:1712.06489·cs.RO·March 10, 2020·1 cites

Multi-Fidelity Reinforcement Learning with Gaussian Processes

Varun Suryan, Nahush Gondhalekar, Pratap Tokekar

PDF

Open Access

TL;DR

This paper introduces multi-fidelity reinforcement learning methods using Gaussian Processes to efficiently learn policies with fewer real-world samples, demonstrated through simulations and ground robot experiments.

Contribution

It proposes both model-based and model-free multi-fidelity RL frameworks leveraging Gaussian Processes to reduce sample complexity in real-world environments.

Findings

01

Up to 40% reduction in samples for model-based RL.

02

Up to 60% reduction in samples for model-free RL.

03

Validated through simulations and ground robot navigation experiments.

Abstract

We study the problem of Reinforcement Learning (RL) using as few real-world samples as possible. A naive application of RL can be inefficient in large and continuous state spaces. We present two versions of Multi-Fidelity Reinforcement Learning (MFRL), model-based and model-free, that leverage Gaussian Processes (GPs) to learn the optimal policy in a real-world environment. In the MFRL framework, an agent uses multiple simulators of the real environment to perform actions. With increasing fidelity in a simulator chain, the number of samples used in successively higher simulators can be reduced. By incorporating GPs in the MFRL framework, we empirically observe up to $40%$ reduction in the number of samples for model-based RL and $60%$ reduction for the model-free version. We examine the performance of our algorithms through simulations and through real-world experiments for navigation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Reinforcement Learning in Robotics