HAS-QA: Hierarchical Answer Spans Model for Open-domain Question Answering
Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Lixin Su, Xueqi Cheng

TL;DR
This paper introduces HAS-QA, a hierarchical model for open-domain question answering that effectively handles multiple answer spans, irrelevant paragraphs, and answer position dependencies, outperforming existing models.
Contribution
The paper proposes a novel probabilistic hierarchical framework and model, HAS-QA, specifically designed to address unique challenges in open-domain QA tasks.
Findings
HAS-QA significantly outperforms traditional RC baselines.
The hierarchical model effectively manages multiple answer spans.
The approach improves accuracy on public OpenQA datasets.
Abstract
This paper is concerned with open-domain question answering (i.e., OpenQA). Recently, some works have viewed this problem as a reading comprehension (RC) task, and directly applied successful RC models to it. However, the performances of such models are not so good as that in the RC task. In our opinion, the perspective of RC ignores three characteristics in OpenQA task: 1) many paragraphs without the answer span are included in the data collection; 2) multiple answer spans may exist within one given paragraph; 3) the end position of an answer span is dependent with the start position. In this paper, we first propose a new probabilistic formulation of OpenQA, based on a three-level hierarchical structure, i.e.,~the question level, the paragraph level and the answer span level. Then a Hierarchical Answer Spans Model (HAS-QA) is designed to capture each probability. HAS-QA has the ability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Expert finding and Q&A systems
