Densely Connected Attention Propagation for Reading Comprehension
Yi Tay, Luu Anh Tuan, Siu Cheung Hui, Jian Su

TL;DR
This paper introduces DecaProp, a densely connected neural network with attention-based links for reading comprehension, achieving state-of-the-art results across multiple benchmarks.
Contribution
DecaProp uniquely combines dense layer connections with attention-based links, enhancing passage-query relationship modeling in reading comprehension.
Findings
Achieves state-of-the-art results on four RC benchmarks.
Outperforms existing models by up to 14.2% in F1 score.
Demonstrates effectiveness of attention-based dense connections.
Abstract
We propose DecaProp (Densely Connected Attention Propagation), a new densely connected neural architecture for reading comprehension (RC). There are two distinct characteristics of our model. Firstly, our model densely connects all pairwise layers of the network, modeling relationships between passage and query across all hierarchical levels. Secondly, the dense connectors in our network are learned via attention instead of standard residual skip-connectors. To this end, we propose novel Bidirectional Attention Connectors (BAC) for efficiently forging connections throughout the network. We conduct extensive experiments on four challenging RC benchmarks. Our proposed approach achieves state-of-the-art results on all four, outperforming existing baselines by up to in absolute F1 score.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Multimodal Machine Learning Applications
