Reinforcement Learning with Convolutional Reservoir Computing

Hanten Chang; Katsuya Futagami

arXiv:1912.04161·cs.NE·December 10, 2019·1 cites

Reinforcement Learning with Convolutional Reservoir Computing

Hanten Chang, Katsuya Futagami

PDF

Open Access

TL;DR

This paper introduces RCRC, a reinforcement learning approach that uses fixed random-weight CNNs and reservoir computing to efficiently extract features and decide actions without extensive training or data storage.

Contribution

The study presents a novel RCRC model that eliminates the need for training feature extractors and storing data, simplifying reinforcement learning processes.

Findings

01

RCRC can solve multiple tasks with a single feature extractor.

02

The model requires only one task-dependent weight to be trained.

03

It achieves high performance without training CNNs or storing data.

Abstract

Recently, reinforcement learning models have achieved great success, mastering complex tasks such as Go and other games with higher scores than human players. Many of these models store considerable data on the tasks and achieve high performance by extracting visual and time-series features using convolutional neural networks (CNNs) and recurrent neural networks, respectively. However, these networks have very high computational costs because they need to be trained by repeatedly using the stored data. In this study, we propose a novel practical approach called reinforcement learning with convolutional reservoir computing (RCRC) model. The RCRC model uses a fixed random-weight CNN and a reservoir computing model to extract visual and time-series features. Using these extracted features, it decides actions with an evolution strategy method. Thereby, the RCRC model has several desirable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Advanced Memory and Neural Computing · Neural dynamics and brain function