When to Trust: A Causality-Aware Calibration Framework for Accurate Knowledge Graph Retrieval-Augmented Generation

Jing Ren; Bowen Li; Ziqi Xu; Xikun Zhang; Haytham Fayek; Xiaodong Li

arXiv:2601.09241·cs.CL·February 4, 2026

When to Trust: A Causality-Aware Calibration Framework for Accurate Knowledge Graph Retrieval-Augmented Generation

Jing Ren, Bowen Li, Ziqi Xu, Xikun Zhang, Haytham Fayek, Xiaodong Li

PDF

Open Access

TL;DR

This paper introduces Ca2KG, a causality-aware calibration framework for KG-RAG that improves the reliability and calibration of knowledge graph-based language models, especially in high-stakes scenarios.

Contribution

It presents a novel calibration method combining counterfactual prompting and panel-based re-scoring to address overconfidence in KG-RAG models.

Findings

01

Ca2KG improves calibration across datasets.

02

Maintains or enhances predictive accuracy.

03

Reduces overconfidence in uncertain retrievals.

Abstract

Knowledge Graph Retrieval-Augmented Generation (KG-RAG) extends the RAG paradigm by incorporating structured knowledge from knowledge graphs, enabling Large Language Models (LLMs) to perform more precise and explainable reasoning. While KG-RAG improves factual accuracy in complex tasks, existing KG-RAG models are often severely overconfident, producing high-confidence predictions even when retrieved sub-graphs are incomplete or unreliable, which raises concerns for deployment in high-stakes domains. To address this issue, we propose Ca2KG, a Causality-aware Calibration framework for KG-RAG. Ca2KG integrates counterfactual prompting, which exposes retrieval-dependent uncertainties in knowledge quality and reasoning reliability, with a panel-based re-scoring mechanism that stabilises predictions across interventions. Extensive experiments on two complex QA datasets demonstrate that Ca2KG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Machine Learning in Healthcare