HalluGraph: Auditable Hallucination Detection for Legal RAG Systems via Knowledge Graph Alignment
Valentin No\"el, Elimane Yassine Seidou, Charly Ken Capo-Chichi, Ghanem Amari

TL;DR
HalluGraph is a graph-based framework that detects hallucinations in legal AI systems by aligning knowledge graphs from context, query, and response, providing interpretable metrics for accountability.
Contribution
It introduces a novel graph-theoretic approach for verifiable hallucination detection in legal RAG systems, improving transparency and reliability over existing semantic similarity methods.
Findings
Achieves near-perfect discrimination on structured control documents.
Attains AUC = 0.979 on legal hallucination detection.
Maintains robust performance (AUC ≈ 0.89) on challenging legal tasks.
Abstract
Legal AI systems powered by retrieval-augmented generation (RAG) face a critical accountability challenge: when an AI assistant cites case law, statutes, or contractual clauses, practitioners need verifiable guarantees that generated text faithfully represents source documents. Existing hallucination detectors rely on semantic similarity metrics that tolerate entity substitutions, a dangerous failure mode when confusing parties, dates, or legal provisions can have material consequences. We introduce HalluGraph, a graph-theoretic framework that quantifies hallucinations through structural alignment between knowledge graphs extracted from context, query, and response. Our approach produces bounded, interpretable metrics decomposed into \textit{Entity Grounding} (EG), measuring whether entities in the response appear in source documents, and \textit{Relation Preservation} (RP), verifying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Ethics and Social Impacts of AI
