There is No Big Brother or Small Brother: Knowledge Infusion in Language Models for Link Prediction and Question Answering
Ankush Agarwal, Sakharam Gawade, Sachin Channabasavarajendra, Pushpak, Bhattacharyya

TL;DR
This paper investigates how knowledge infusion affects language models' performance in link prediction and question answering across aviation, movie, and web domains, finding small models perform comparably to larger ones.
Contribution
It demonstrates that small language models with knowledge infusion can match larger models' performance in link prediction and QA tasks across multiple domains.
Findings
Small models achieve similar performance to large models with knowledge infusion.
Synthetic QA pairs enable effective evaluation of knowledge-infused models.
Substantial agreement between models confirmed by Cohen's kappa score.
Abstract
The integration of knowledge graphs with deep learning is thriving in improving the performance of various natural language processing (NLP) tasks. In this paper, we focus on knowledge-infused link prediction and question answering using language models, T5, and BLOOM across three domains: Aviation, Movie, and Web. In this context, we infuse knowledge in large and small language models and study their performance, and find the performance to be similar. For the link prediction task on the Aviation Knowledge Graph, we obtain a 0.2 hits@1 score using T5-small, T5-base, T5-large, and BLOOM. Using template-based scripts, we create a set of 1 million synthetic factoid QA pairs in the aviation domain from National Transportation Safety Board (NTSB) reports. On our curated QA pairs, the three models of T5 achieve a 0.7 hits@1 score. We validate out findings with the paired student t-test and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Data Quality and Management
MethodsAttention Is All You Need · BLOOM · Dense Connections · Layer Normalization · Attention Dropout · Byte Pair Encoding · Linear Layer · Dropout · Adafactor · Softmax
