Recursive Least Squares Policy Control with Echo State Network

Chunyuan Zhang; Chao Liu; Qi Song; Jie Zhao

arXiv:2201.04781·cs.LG·January 14, 2022·1 cites

Recursive Least Squares Policy Control with Echo State Network

Chunyuan Zhang, Chao Liu, Qi Song, Jie Zhao

PDF

Open Access

TL;DR

This paper introduces two novel policy control algorithms using echo state networks with recursive least squares, incorporating mini-batch learning, regularization, and overestimation prevention to improve convergence in time-series tasks.

Contribution

The paper proposes ESNRLS-Q and ESNRLS-Sarsa algorithms that adapt RLS for ESN training in mini-batch mode with regularization and overestimation control, addressing prior limitations.

Findings

01

Algorithms demonstrate good convergence performance.

02

Effective reduction of overfitting and overestimation.

03

Enhanced stability in policy control tasks.

Abstract

The echo state network (ESN) is a special type of recurrent neural networks for processing the time-series dataset. However, limited by the strong correlation among sequential samples of the agent, ESN-based policy control algorithms are difficult to use the recursive least squares (RLS) algorithm to update the ESN's parameters. To solve this problem, we propose two novel policy control algorithms, ESNRLS-Q and ESNRLS-Sarsa. Firstly, to reduce the correlation of training samples, we use the leaky integrator ESN and the mini-batch learning mode. Secondly, to make RLS suitable for training ESN in mini-batch mode, we present a new mean-approximation method for updating the RLS correlation matrix. Thirdly, to prevent ESN from over-fitting, we use the L1 regularization technique. Lastly, to prevent the target state-action value from overestimation, we employ the Mellowmax method. Simulation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Neural Networks and Applications · Advanced Memory and Neural Computing

MethodsL1 Regularization