Unmasking Hallucinations: A Causal Graph-Attention Perspective on Factual Reliability in Large Language Models
Sailesh kiran kurra, Shiek Ruksana, Vishal Borusu

TL;DR
This paper introduces a causal graph attention network framework that interprets internal attention flows in LLMs to reduce hallucinations, significantly improving factual accuracy and interpretability.
Contribution
It proposes a novel causal graph attention network with token-level graphs and a new metric, CCS, to mitigate hallucinations in LLMs.
Findings
27.8% reduction in hallucination rate on benchmarks
16.4% improvement in factual accuracy
Enhanced interpretability and robustness of LLMs
Abstract
This paper primarily focuses on the hallucinations caused due to AI language models(LLMs).LLMs have shown extraordinary Language understanding and generation capabilities .Still it has major a disadvantage hallucinations which give outputs which are factually incorrect ,misleading or unsupported by input data . These hallucinations cause serious problems in scenarios like medical diagnosis or legal reasoning.Through this work,we propose causal graph attention network (GCAN) framework that reduces hallucinations through interpretation of internal attention flow within a transformer architecture with the help of constructing token level graphs that combine self attention weights and gradient based influence scores.our method quantifies each tokens factual dependency using a new metric called the Causal Contribution Score (CCS). We further introduce a fact-anchored graph reweighting layer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
