A Clockwork RNN
Jan Koutn\'ik, Klaus Greff, Faustino Gomez, J\"urgen Schmidhuber

TL;DR
The paper introduces the Clockwork RNN, a modified architecture that partitions the hidden layer into modules operating at different clock rates, enhancing long-term dependency modeling, reducing parameters, and improving performance in sequence tasks.
Contribution
It proposes a novel RNN architecture that improves training efficiency and performance by processing different parts of the network at separate temporal scales.
Findings
Outperforms standard RNNs and LSTMs in sequence tasks
Reduces number of parameters compared to standard RNNs
Speeds up network evaluation
Abstract
Sequence prediction and classification are ubiquitous and challenging problems in machine learning that can require identifying complex dependencies between temporally distant inputs. Recurrent Neural Networks (RNNs) have the ability, in theory, to cope with these temporal dependencies by virtue of the short-term memory implemented by their recurrent (feedback) connections. However, in practice they are difficult to train successfully when the long-term memory is required. This paper introduces a simple, yet powerful modification to the standard RNN architecture, the Clockwork RNN (CW-RNN), in which the hidden layer is partitioned into separate modules, each processing inputs at its own temporal granularity, making computations only at its prescribed clock rate. Rather than making the standard RNN models more complex, CW-RNN reduces the number of RNN parameters, improves the performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Time Series Analysis and Forecasting
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
