DP-KB: Data Programming with Knowledge Bases Improves Transformer Fine Tuning for Answer Sentence Selection
Nic Jedema, Thuy Vu, Manish Gupta, and Alessandro Moschitti

TL;DR
This paper introduces DP-KB, a data programming method that leverages knowledge bases to enhance transformer fine-tuning for answer sentence selection, leading to improved performance on multiple benchmarks and real-world datasets.
Contribution
The paper presents a novel data programming technique that enriches training data with KB-derived context, boosting transformer performance without increasing inference costs.
Findings
Outperforms state-of-the-art on WikiQA and TrecQA benchmarks.
Achieves significant improvements in F1, MAP, and MRR scores.
Maintains performance even without KB context at inference time.
Abstract
While transformers demonstrate impressive performance on many knowledge intensive (KI) tasks, their ability to serve as implicit knowledge bases (KBs) remains limited, as shown on several slot-filling, question-answering (QA), fact verification, and entity-linking tasks. In this paper, we implement an efficient, data-programming technique that enriches training data with KB-derived context and improves transformer utilization of encoded knowledge when fine-tuning for a particular QA task, namely answer sentence selection (AS2). Our method outperforms state of the art transformer approach on WikiQA and TrecQA, two widely studied AS2 benchmarks, increasing by 2.0% p@1, 1.3% MAP, 1.1% MRR, and 4.4% p@1, 0.9% MAP, 2.4% MRR, respectively. To demonstrate our improvements in an industry setting, we additionally evaluate our approach on a proprietary dataset of Alexa QA pairs, and show increase…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
