Modular Networks Prevent Catastrophic Interference in Model-Based   Multi-Task Reinforcement Learning

Robin Schiewer; Laurenz Wiskott

arXiv:2111.08010·cs.LG·November 17, 2021

Modular Networks Prevent Catastrophic Interference in Model-Based Multi-Task Reinforcement Learning

Robin Schiewer, Laurenz Wiskott

PDF

Open Access 1 Repo

TL;DR

This paper investigates how modular network structures can prevent catastrophic interference in model-based multi-task reinforcement learning, showing that isolated sub-networks improve performance over shared models.

Contribution

It demonstrates that modular networks with isolated sub-networks mitigate task confusion and enhance multi-task learning in model-based reinforcement learning.

Findings

01

Shared dynamics models cause task confusion and performance drops.

02

Modular networks with isolated sub-networks improve multi-task learning.

03

Results are validated on gridworld and VizDoom environments.

Abstract

In a multi-task reinforcement learning setting, the learner commonly benefits from training on multiple related tasks by exploiting similarities among them. At the same time, the trained agent is able to solve a wider range of different problems. While this effect is well documented for model-free multi-task methods, we demonstrate a detrimental effect when using a single learned dynamics model for multiple tasks. Thus, we address the fundamental question of whether model-based multi-task reinforcement learning benefits from shared dynamics models in a similar way model-free methods do from shared policy networks. Using a single dynamics model, we see clear evidence of task confusion and reduced performance. As a remedy, enforcing an internal structure for the learned dynamics model by training isolated sub-networks for each task notably improves performance while using the same amount…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rschiewer/lrdm
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Reservoir Computing · Age of Information Optimization