Towards Autonomous Reinforcement Learning: Automatic Setting of   Hyper-parameters using Bayesian Optimization

Juan Cruz Barsce; Jorge A. Palombarini; Ernesto C. Mart\'inez

arXiv:1805.04748·cs.AI·May 15, 2018

Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization

Juan Cruz Barsce, Jorge A. Palombarini, Ernesto C. Mart\'inez

PDF

TL;DR

This paper proposes an autonomous reinforcement learning framework that uses Bayesian optimization and bandit strategies to automatically tune hyper-parameters, improving learning efficiency in uncertain environments.

Contribution

It introduces a novel integration of Bayesian optimization with Gaussian processes and bandit methods for automatic hyper-parameter tuning in reinforcement learning.

Findings

01

Hyper-parameter optimization improves SARSA performance.

02

The framework reduces the need for manual tuning.

03

Demonstrated effectiveness on a gridworld example.

Abstract

With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key factor for achieving satisfactory performance regardless of user expertise in the inner workings of the techniques and methodologies. In particular, for a reinforcement learning algorithm, the efficiency of an agent learning a control policy in an uncertain environment is heavily dependent on the hyper-parameters used to balance exploration with exploitation. In this work, an autonomous learning framework that integrates Bayesian optimization with Gaussian process regression to optimize the hyper-parameters of a reinforcement learning algorithm, is proposed. Also, a bandits-based approach to achieve a balance between computational costs and decreasing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGaussian Process