FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering
Akhil Kedia, Mohd Abbas Zaidi, Haejun Lee

TL;DR
This paper introduces FiE, a transformer encoder extension that fuses information across multiple passages to improve open-domain question answering, achieving state-of-the-art results with fewer parameters and lower latency.
Contribution
The paper proposes a novel encoder fusion method and an alternative answer scoring approach, significantly enhancing efficiency and accuracy in open-domain QA models.
Findings
Outperforms state-of-the-art by 2.5 EM on Natural Questions
Uses only 25% of parameters and 35% of inference latency
Achieves 4.4 EM on WebQuestions with improved data augmentation
Abstract
Generative models have recently started to outperform extractive models in Open Domain Question Answering, largely by leveraging their decoder to attend over multiple encoded passages and combining their information. However, generative models tend to be larger than extractive models due to the need for a decoder, run slower during inference due to auto-regressive decoder beam search, and their generated output often suffers from hallucinations. We propose to extend transformer encoders with the ability to fuse information from multiple passages, using global representation to provide cross-sample attention over all tokens across samples. Furthermore, we propose an alternative answer span probability calculation to better aggregate answer scores in the global space of all samples. Using our proposed method, we outperform the current state-of-the-art method by Exact Match score on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Expert finding and Q&A systems
