MEMEN: Multi-layer Embedding with Memory Networks for Machine Comprehension
Boyuan Pan, Hao Li, Zhou Zhao, Bin Cao, Deng Cai, Xiaofei He

TL;DR
MEMEN introduces a multi-layer embedding with memory networks that enhances encoding of syntactic and semantic information, leading to state-of-the-art performance in machine comprehension tasks like SQuAD and TriviaQA.
Contribution
The paper proposes a novel neural network architecture combining multi-layer embeddings trained with skip-gram and a full-orientation matching memory network for improved machine reading comprehension.
Findings
Achieves state-of-the-art results on TriviaQA dataset.
Demonstrates competitive performance on SQuAD.
Improves encoding of syntactic and semantic information.
Abstract
Machine comprehension(MC) style question answering is a representative problem in natural language processing. Previous methods rarely spend time on the improvement of encoding layer, especially the embedding of syntactic information and name entity of the words, which are very crucial to the quality of encoding. Moreover, existing attention methods represent each query word as a vector or use a single vector to represent the whole query sentence, neither of them can handle the proper weight of the key words in query sentence. In this paper, we introduce a novel neural network architecture called Multi-layer Embedding with Memory Network(MEMEN) for machine reading task. In the encoding layer, we employ classic skip-gram model to the syntactic and semantic information of the words to train a new kind of embedding layer. We also propose a memory network of full-orientation matching of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMemory Network
