General policy mapping: online continual reinforcement learning inspired   on the insect brain

Angel Yanguas-Gil; Sandeep Madireddy

arXiv:2211.16759·cs.LG·December 1, 2022

General policy mapping: online continual reinforcement learning inspired on the insect brain

Angel Yanguas-Gil, Sandeep Madireddy

PDF

Open Access 1 Repo

TL;DR

This paper introduces a biologically inspired model for online continual reinforcement learning that leverages shared policy layers and offline feature training, enabling positive backward transfer and efficient learning in resource-limited settings.

Contribution

It proposes a novel RL model inspired by insect brains that combines offline feature extraction with shared policy layers for improved online learning.

Findings

01

Positive backward transfer observed across tasks

02

Biologically inspired network restrictions are crucial for convergence

03

Model enables efficient online RL in resource-constrained environments

Abstract

We have developed a model for online continual or lifelong reinforcement learning (RL) inspired on the insect brain. Our model leverages the offline training of a feature extraction and a common general policy layer to enable the convergence of RL algorithms in online settings. Sharing a common policy layer across tasks leads to positive backward transfer, where the agent continuously improved in older tasks sharing the same underlying general policy. Biologically inspired restrictions to the agent's network are key for the convergence of RL algorithms. This provides a pathway towards efficient online RL in resource-constrained scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anglyan/roundworld
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics