Sparsity-Driven Plasticity in Multi-Task Reinforcement Learning
Aleksandar Todorov, Juan Cardenas-Cartagena, Rafael F. Cunha, Marco Zullich, Matthia Sabatelli

TL;DR
This paper investigates how sparsification techniques like GMP and SET can mitigate plasticity loss in multi-task reinforcement learning, leading to improved adaptability and performance across various architectures and benchmarks.
Contribution
It systematically demonstrates that sparsification methods enhance plasticity and multi-task performance, providing new insights into their role in MTRL systems.
Findings
GMP and SET reduce neuron dormancy and representational collapse.
Sparse agents often outperform dense baselines in MTRL tasks.
Sparsification improves adaptability and performance in diverse MTRL architectures.
Abstract
Plasticity loss, a diminishing capacity to adapt as training progresses, is a critical challenge in deep reinforcement learning. We examine this issue in multi-task reinforcement learning (MTRL), where higher representational flexibility is crucial for managing diverse and potentially conflicting task demands. We systematically explore how sparsification methods, particularly Gradual Magnitude Pruning (GMP) and Sparse Evolutionary Training (SET), enhance plasticity and consequently improve performance in MTRL agents. We evaluate these approaches across distinct MTRL architectures (shared backbone, Mixture of Experts, Mixture of Orthogonal Experts) on standardized MTRL benchmarks, comparing against dense baselines, and a comprehensive range of alternative plasticity-inducing or regularization methods. Our results demonstrate that both GMP and SET effectively mitigate key indicators of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Neural Networks and Reservoir Computing
