Evolving Reservoirs for Meta Reinforcement Learning
Corentin L\'eger, Gautier Hamon, Eleni Nisioti, Xavier Hinaut, and Cl\'ement Moulin-Frier

TL;DR
This paper introduces an evolutionary approach to optimize reservoir architectures in meta reinforcement learning, enhancing an agent's ability to adapt to complex, partially observable, and novel tasks across diverse environments.
Contribution
It proposes evolving reservoir hyperparameters instead of weights to improve meta RL performance, enabling better adaptation and generalization in complex tasks.
Findings
Evolved reservoirs improve learning in challenging tasks
Reservoir architecture aids in solving partially observable environments
Enhanced generalization to unseen tasks
Abstract
Animals often demonstrate a remarkable ability to adapt to their environments during their lifetime. They do so partly due to the evolution of morphological and neural structures. These structures capture features of environments shared between generations to bias and speed up lifetime learning. In this work, we propose a computational model for studying a mechanism that can enable such a process. We adopt a computational framework based on meta reinforcement learning as a model of the interplay between evolution and development. At the evolutionary scale, we evolve reservoirs, a family of recurrent neural networks that differ from conventional networks in that one optimizes not the synaptic weights, but hyperparameters controlling macro-level properties of the resulting network architecture. At the developmental scale, we employ these evolved reservoirs to facilitate the learning of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Advanced Memory and Neural Computing · Neural dynamics and brain function
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
