A journey in ESN and LSTM visualisations on a language task
Alexandre Variengien, Xavier Hinaut

TL;DR
This paper compares ESN and LSTM architectures on a language learning task, analyzing their performance, internal dynamics, and visualizing their latent state spaces to understand their mechanisms and differences.
Contribution
It introduces a comprehensive comparison of ESN and LSTM on a language task, including a novel visualization method for internal states and insights into their shared mechanisms.
Findings
Both models successfully learned the task.
LSTM achieved lower error on simple data, ESN trained faster.
ESN outperformed LSTM on challenging datasets without tuning.
Abstract
Echo States Networks (ESN) and Long-Short Term Memory networks (LSTM) are two popular architectures of Recurrent Neural Networks (RNN) to solve machine learning task involving sequential data. However, little have been done to compare their performances and their internal mechanisms on a common task. In this work, we trained ESNs and LSTMs on a Cross-Situationnal Learning (CSL) task. This task aims at modelling how infants learn language: they create associations between words and visual stimuli in order to extract meaning from words and sentences. The results are of three kinds: performance comparison, internal dynamics analyses and visualization of latent space. (1) We found that both models were able to successfully learn the task: the LSTM reached the lowest error for the basic corpus, but the ESN was quicker to train. Furthermore, the ESN was able to outperform LSTMs on datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Neural Networks and Applications · Stock Market Forecasting Methods
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
