Pay More Attention - Neural Architectures for Question-Answering
Zia Hasan, Sebastian Fischer

TL;DR
This paper explores advanced attention mechanisms for machine comprehension, proposing a hybrid model and a new simplified attention method that improve performance on the SQuAD dataset.
Contribution
It introduces a novel Double Cross Attention mechanism and a hybrid scheme combining BiDAF and DCN, enhancing question-answering accuracy.
Findings
The hybrid model outperforms individual attention mechanisms.
Double Cross Attention achieves superior results with simpler architecture.
Both proposed models surpass previous state-of-the-art on SQuAD.
Abstract
Machine comprehension is a representative task of natural language understanding. Typically, we are given context paragraph and the objective is to answer a question that depends on the context. Such a problem requires to model the complex interactions between the context paragraph and the question. Lately, attention mechanisms have been found to be quite successful at these tasks and in particular, attention mechanisms with attention flow from both context-to-question and question-to-context have been proven to be quite useful. In this paper, we study two state-of-the-art attention mechanisms called Bi-Directional Attention Flow (BiDAF) and Dynamic Co-Attention Network (DCN) and propose a hybrid scheme combining these two architectures that gives better overall performance. Moreover, we also suggest a new simpler attention mechanism that we call Double Cross Attention (DCA) that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
