Going Wider: Recurrent Neural Network With Parallel Cells

Danhao Zhu; Si Shen; Xin-Yu Dai; Jiajun Chen

arXiv:1705.01346·cs.CL·May 4, 2017·5 cites

Going Wider: Recurrent Neural Network With Parallel Cells

Danhao Zhu, Si Shen, Xin-Yu Dai, Jiajun Chen

PDF

Open Access

TL;DR

This paper introduces parallel cells in RNNs, running multiple small cells per layer to improve sequence modeling, demonstrating enhanced performance on language modeling and translation tasks.

Contribution

It proposes a novel parallel cells technique for RNNs, which improves learning ability and achieves better results than existing models.

Findings

01

Reduced perplexity on PTB from 78.6 to 75.3

02

Increased BLEU score by 0.39 points in translation

03

Enhanced RNN performance on sequence tasks

Abstract

Recurrent Neural Network (RNN) has been widely applied for sequence modeling. In RNN, the hidden states at current step are full connected to those at previous step, thus the influence from less related features at previous step may potentially decrease model's learning ability. We propose a simple technique called parallel cells (PCs) to enhance the learning ability of Recurrent Neural Network (RNN). In each layer, we run multiple small RNN cells rather than one single large cell. In this paper, we evaluate PCs on 2 tasks. On language modeling task on PTB (Penn Tree Bank), our model outperforms state of art models by decreasing perplexity from 78.6 to 75.3. On Chinese-English translation task, our model increases BLEU score for 0.39 points than baseline model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Neural Networks and Applications