Mixture of Experts in a Mixture of RL settings

Timon Willi; Johan Obando-Ceron; Jakob Foerster; Karolina Dziugaite,; Pablo Samuel Castro

arXiv:2406.18420·cs.LG·June 27, 2024

Mixture of Experts in a Mixture of RL settings

Timon Willi, Johan Obando-Ceron, Jakob Foerster, Karolina Dziugaite,, Pablo Samuel Castro

PDF

Open Access

TL;DR

This paper investigates how Mixture of Experts (MoEs) can improve Deep Reinforcement Learning (DRL) in highly non-stationary, multi-task environments, providing new insights into their mechanisms and optimal integration.

Contribution

It offers a detailed analysis of MoEs' ability to handle non-stationarity in DRL, especially in multi-task settings, and explores how different MoE components influence learning.

Findings

01

MoEs enhance DRL performance in non-stationary, multi-task environments.

02

Multi-task experiments reveal the mechanisms behind MoEs' benefits.

03

Optimal MoE component integration improves actor-critic DRL networks.

Abstract

Mixtures of Experts (MoEs) have gained prominence in (self-)supervised learning due to their enhanced inference efficiency, adaptability to distributed training, and modularity. Previous research has illustrated that MoEs can significantly boost Deep Reinforcement Learning (DRL) performance by expanding the network's parameter count while reducing dormant neurons, thereby enhancing the model's learning capacity and ability to deal with non-stationarity. In this work, we shed more light on MoEs' ability to deal with non-stationarity and investigate MoEs in DRL settings with "amplified" non-stationarity via multi-task training, providing further evidence that MoEs improve learning capacity. In contrast to previous work, our multi-task results allow us to better understand the underlying causes for the beneficial effect of MoE in DRL training, the impact of the various MoE components, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Expert finding and Q&A systems · Distributed Sensor Networks and Detection Algorithms

MethodsMixture of Experts