QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
Michihiro Yasunaga, Hongyu Ren, Antoine Bosselut, Percy Liang, Jure, Leskovec

TL;DR
QA-GNN introduces a novel approach combining language models and knowledge graphs through joint graph reasoning, significantly improving question answering performance and interpretability across multiple domains.
Contribution
It proposes a new model that uses relevance scoring and joint graph reasoning to enhance QA by integrating language models with knowledge graphs.
Findings
Outperforms existing LM and LM+KG models on benchmark datasets.
Demonstrates interpretable reasoning, including handling negation.
Effective across commonsense and biomedical QA tasks.
Abstract
The problem of answering questions using knowledge from pre-trained language models (LMs) and knowledge graphs (KGs) presents two challenges: given a QA context (question and answer choice), methods need to (i) identify relevant knowledge from large KGs, and (ii) perform joint reasoning over the QA context and KG. In this work, we propose a new model, QA-GNN, which addresses the above challenges through two key innovations: (i) relevance scoring, where we use LMs to estimate the importance of KG nodes relative to the given QA context, and (ii) joint reasoning, where we connect the QA context and KG to form a joint graph, and mutually update their representations through graph neural networks. We evaluate our model on QA benchmarks in the commonsense (CommonsenseQA, OpenBookQA) and biomedical (MedQA-USMLE) domains. QA-GNN outperforms existing LM and LM+KG models, and exhibits…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Graph Attention Network · Dropout · Byte Pair Encoding · Adam · Dense Connections
