On Multiplicative Integration with Recurrent Neural Networks

Yuhuai Wu; Saizheng Zhang; Ying Zhang; Yoshua Bengio; Ruslan; Salakhutdinov

arXiv:1606.06630·cs.LG·November 15, 2016·60 cites

On Multiplicative Integration with Recurrent Neural Networks

Yuhuai Wu, Saizheng Zhang, Ying Zhang, Yoshua Bengio, Ruslan, Salakhutdinov

PDF

Open Access

TL;DR

This paper proposes Multiplicative Integration (MI), a simple structural modification for RNNs that enhances information flow and improves performance across various models and tasks with minimal additional parameters.

Contribution

It introduces MI as a versatile, easy-to-implement structural change that can be integrated into existing RNN architectures like LSTMs and GRUs.

Findings

01

MI significantly improves RNN performance on multiple tasks.

02

MI can be embedded into various RNN models with minimal parameter increase.

03

Empirical analysis shows better learning behavior with MI.

Abstract

We introduce a general and simple structural design called Multiplicative Integration (MI) to improve recurrent neural networks (RNNs). MI changes the way in which information from difference sources flows and is integrated in the computational building block of an RNN, while introducing almost no extra parameters. The new structure can be easily embedded into many popular RNN models, including LSTMs and GRUs. We empirically analyze its learning behaviour and conduct evaluations on several tasks using different RNN models. Our experimental results demonstrate that Multiplicative Integration can provide a substantial performance boost over many of the existing RNN models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Music and Audio Processing · Topic Modeling