K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for   Question Answering

Fu Sun; Feng-Lin Li; Ruize Wang; Qianglong Chen; Xingyi Cheng; Ji; Zhang

arXiv:2109.10547·cs.AI·September 23, 2021

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Fu Sun, Feng-Lin Li, Ruize Wang, Qianglong Chen, Xingyi Cheng, Ji, Zhang

PDF

Open Access

TL;DR

K-AID is a systematic method that enhances pre-trained language models with domain relational knowledge, improving question answering and text classification tasks, while enabling deployment on resource-limited devices.

Contribution

The paper introduces K-AID, a novel approach capturing relational knowledge for domain-specific language models, with a focus on practical deployment and improved QA performance.

Findings

01

Significant improvements on sentence-level QA tasks.

02

Effective knowledge infusion and model size reduction.

03

Beneficial business impact demonstrated in E-commerce.

Abstract

Knowledge enhanced pre-trained language models (K-PLMs) are shown to be effective for many public tasks in the literature but few of them have been successfully applied in practice. To address this problem, we propose K-AID, a systematic approach that includes a low-cost knowledge acquisition process for acquiring domain knowledge, an effective knowledge infusion module for improving model performance, and a knowledge distillation component for reducing the model size and deploying K-PLMs on resource-restricted devices (e.g., CPU) for real-world application. Importantly, instead of capturing entity knowledge like the majority of existing K-PLMs, our approach captures relational knowledge, which contributes to better-improving sentence-level text classification and text matching tasks that play a key role in question answering (QA). We conducted a set of experiments on five text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsKnowledge Distillation