K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering
Fu Sun, Feng-Lin Li, Ruize Wang, Qianglong Chen, Xingyi Cheng, Ji, Zhang

TL;DR
K-AID is a systematic method that enhances pre-trained language models with domain relational knowledge, improving question answering and text classification tasks, while enabling deployment on resource-limited devices.
Contribution
The paper introduces K-AID, a novel approach capturing relational knowledge for domain-specific language models, with a focus on practical deployment and improved QA performance.
Findings
Significant improvements on sentence-level QA tasks.
Effective knowledge infusion and model size reduction.
Beneficial business impact demonstrated in E-commerce.
Abstract
Knowledge enhanced pre-trained language models (K-PLMs) are shown to be effective for many public tasks in the literature but few of them have been successfully applied in practice. To address this problem, we propose K-AID, a systematic approach that includes a low-cost knowledge acquisition process for acquiring domain knowledge, an effective knowledge infusion module for improving model performance, and a knowledge distillation component for reducing the model size and deploying K-PLMs on resource-restricted devices (e.g., CPU) for real-world application. Importantly, instead of capturing entity knowledge like the majority of existing K-PLMs, our approach captures relational knowledge, which contributes to better-improving sentence-level text classification and text matching tasks that play a key role in question answering (QA). We conducted a set of experiments on five text…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsKnowledge Distillation
