Going Wider: Recurrent Neural Network With Parallel Cells
Danhao Zhu, Si Shen, Xin-Yu Dai, Jiajun Chen

TL;DR
This paper introduces parallel cells in RNNs, running multiple small cells per layer to improve sequence modeling, demonstrating enhanced performance on language modeling and translation tasks.
Contribution
It proposes a novel parallel cells technique for RNNs, which improves learning ability and achieves better results than existing models.
Findings
Reduced perplexity on PTB from 78.6 to 75.3
Increased BLEU score by 0.39 points in translation
Enhanced RNN performance on sequence tasks
Abstract
Recurrent Neural Network (RNN) has been widely applied for sequence modeling. In RNN, the hidden states at current step are full connected to those at previous step, thus the influence from less related features at previous step may potentially decrease model's learning ability. We propose a simple technique called parallel cells (PCs) to enhance the learning ability of Recurrent Neural Network (RNN). In each layer, we run multiple small RNN cells rather than one single large cell. In this paper, we evaluate PCs on 2 tasks. On language modeling task on PTB (Penn Tree Bank), our model outperforms state of art models by decreasing perplexity from 78.6 to 75.3. On Chinese-English translation task, our model increases BLEU score for 0.39 points than baseline model.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Neural Networks and Applications
