REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using   Reinforcement Learning Agents

Aristotelis Lazaridis; Ioannis Vlahavas

arXiv:2110.05128·cs.LG·May 12, 2022

REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents

Aristotelis Lazaridis, Ioannis Vlahavas

PDF

TL;DR

REIN-2 introduces a meta-learning framework within reinforcement learning that trains a meta-learner to generate specialized RL agents, significantly improving sample efficiency and stability in complex environments.

Contribution

The paper presents REIN-2, a novel meta-learning scheme that enables RL agents to learn how to produce effective inner-learners, enhancing performance and sample efficiency over traditional Deep RL methods.

Findings

01

REIN-2 outperforms traditional Deep RL algorithms in OpenAI Gym environments.

02

REIN-2 demonstrates improved sample efficiency, especially in complex tasks like Mountain Car.

03

Experimental results show increased stability and scoring accuracy.

Abstract

Deep Reinforcement Learning (Deep RL) has been in the spotlight for the past few years, due to its remarkable abilities to solve problems which were considered to be practically unsolvable using traditional Machine Learning methods. However, even state-of-the-art Deep RL algorithms have various weaknesses that prevent them from being used extensively within industry applications, with one such major weakness being their sample-inefficiency. In an effort to patch these issues, we integrated a meta-learning technique in order to shift the objective of learning to solve a task into the objective of learning how to learn to solve a task (or a set of tasks), which we empirically show that improves overall stability and performance of Deep RL algorithms. Our model, named REIN-2, is a meta-learning scheme formulated within the RL framework, the goal of which is to develop a meta-RL agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.