REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents
Aristotelis Lazaridis, Ioannis Vlahavas

TL;DR
REIN-2 introduces a meta-learning framework within reinforcement learning that trains a meta-learner to generate specialized RL agents, significantly improving sample efficiency and stability in complex environments.
Contribution
The paper presents REIN-2, a novel meta-learning scheme that enables RL agents to learn how to produce effective inner-learners, enhancing performance and sample efficiency over traditional Deep RL methods.
Findings
REIN-2 outperforms traditional Deep RL algorithms in OpenAI Gym environments.
REIN-2 demonstrates improved sample efficiency, especially in complex tasks like Mountain Car.
Experimental results show increased stability and scoring accuracy.
Abstract
Deep Reinforcement Learning (Deep RL) has been in the spotlight for the past few years, due to its remarkable abilities to solve problems which were considered to be practically unsolvable using traditional Machine Learning methods. However, even state-of-the-art Deep RL algorithms have various weaknesses that prevent them from being used extensively within industry applications, with one such major weakness being their sample-inefficiency. In an effort to patch these issues, we integrated a meta-learning technique in order to shift the objective of learning to solve a task into the objective of learning how to learn to solve a task (or a set of tasks), which we empirically show that improves overall stability and performance of Deep RL algorithms. Our model, named REIN-2, is a meta-learning scheme formulated within the RL framework, the goal of which is to develop a meta-RL agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
