Adaptive Recurrent Neural Network Based on Mixture Layer

Kui Zhao; Yuechuan Li; Chi Zhang; Cheng Yang; Huan Xu

arXiv:1801.08094·cs.LG·February 28, 2019

Adaptive Recurrent Neural Network Based on Mixture Layer

Kui Zhao, Yuechuan Li, Chi Zhang, Cheng Yang, Huan Xu

PDF

Open Access

TL;DR

This paper introduces an adaptive RNN with a mixture layer that clusters sequence patterns into prototypes, enhancing its ability to model sequences with multiple patterns, and demonstrates its effectiveness through experiments.

Contribution

The paper proposes a novel mixture layer for RNNs that adaptively updates states based on pattern prototypes, improving multi-pattern sequence modeling.

Findings

01

Effective in modeling sequences with multiple patterns

02

Improves RNN performance on synthetic and real datasets

03

Can incorporate prior knowledge for further enhancement

Abstract

Although Recurrent Neural Network (RNN) has been a powerful tool for modeling sequential data, its performance is inadequate when processing sequences with multiple patterns. In this paper, we address this challenge by introducing a novel mixture layer and constructing an adaptive RNN. The mixture layer augmented RNN (termed as M-RNN) partitions patterns in training sequences into several clusters and stores the principle patterns as prototype vectors of components in a mixture model. By leveraging the mixture layer, the proposed method can adaptively update states according to the similarities between encoded inputs and prototype vectors, leading to a stronger capacity in assimilating sequences with multiple patterns. Moreover, our approach can be further extended by taking advantage of prior knowledge about data. Experiments on both synthetic and real datasets demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Music and Audio Processing · Text and Document Classification Technologies