Modulating Reservoir Dynamics via Reinforcement Learning for Efficient Robot Skill Synthesis
Zahra Koulaeizadeh, Erhan Oztop

TL;DR
This paper introduces a reservoir computing framework combined with reinforcement learning to enable efficient robot skill learning and online modulation, allowing robots to generate diverse movements beyond initial demonstrations.
Contribution
The work presents a novel RC-based Learning from Demonstration method that incorporates RL for dynamic context modulation, enabling out-of-distribution movement generation without retraining the reservoir.
Findings
Effective generation of reaching movements with obstacle avoidance.
RL-driven context modulation extends action repertoire.
Efficient learning with low-dimensional context and no iterative training.
Abstract
A random recurrent neural network, called a reservoir, can be used to learn robot movements conditioned on context inputs that encode task goals. The Learning is achieved by mapping the random dynamics of the reservoir modulated by context to desired trajectories via linear regression. This makes the reservoir computing (RC) approach computationally efficient as no iterative gradient descent learning is needed. In this work, we propose a novel RC-based Learning from Demonstration (LfD) framework that not only learns to generate the demonstrated movements but also allows online modulation of the reservoir dynamics to generate movement trajectories that are not covered by the initial demonstration set. This is made possible by using a Reinforcement Learning (RL) module that learns a policy to output context as its actions based on the robot state. Considering that the context dimension is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Robot Manipulation and Learning
MethodsSparse Evolutionary Training
