Neural Associative Memory for Dual-Sequence Modeling

Dirk Weissenborn

arXiv:1606.03864·cs.NE·June 15, 2016

Neural Associative Memory for Dual-Sequence Modeling

Dirk Weissenborn

PDF

1 Repo

TL;DR

This paper introduces a novel neural architecture called Dual AM-RNN, based on associative memory, designed for dual-sequence NLP tasks, showing promising results in textual entailment and long-range dependency modeling.

Contribution

It proposes a new dual-sequence neural model using associative memory, extending RNNs to better capture dependencies in sequence-to-sequence tasks.

Findings

01

Achieves competitive results on textual entailment.

02

Effectively bridges long-range dependencies.

03

Needs additional supervision for sequence-to-sequence tasks.

Abstract

Many important NLP problems can be posed as dual-sequence or sequence-to-sequence modeling tasks. Recent advances in building end-to-end neural architectures have been highly successful in solving such tasks. In this work we propose a new architecture for dual-sequence modeling that is based on associative memory. We derive AM-RNNs, a recurrent associative memory (AM) which augments generic recurrent neural networks (RNN). This architecture is extended to the Dual AM-RNN which operates on two AMs at once. Our models achieve very competitive results on textual entailment. A qualitative analysis demonstrates that long range dependencies between source and target-sequence can be bridged effectively using Dual AM-RNNs. However, an initial experiment on auto-encoding reveals that these benefits are not exploited by the system when learning to solve sequence-to-sequence tasks which indicates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dirkweissenborn/dual_am_rnn
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.