TL;DR
DAVinCI is a dual attribution and verification framework that enhances the factual reliability and interpretability of large language models by attributing claims to sources and verifying them through entailment reasoning.
Contribution
Introduces DAVinCI, a novel two-stage framework combining attribution and verification to improve trustworthiness of LLM outputs, with extensive evaluation and open-source implementation.
Findings
DAVinCI improves classification accuracy and attribution metrics by 5-20%.
It effectively attributes claims to internal components and external sources.
The framework enhances the trustworthiness and accountability of LLMs.
Abstract
Large Language Models (LLMs) have demonstrated remarkable fluency and versatility across a wide range of NLP tasks, yet they remain prone to factual inaccuracies and hallucinations. This limitation poses significant risks in high-stakes domains such as healthcare, law, and scientific communication, where trust and verifiability are paramount. In this paper, we introduce DAVinCI - a Dual Attribution and Verification framework designed to enhance the factual reliability and interpretability of LLM outputs. DAVinCI operates in two stages: (i) it attributes generated claims to internal model components and external sources; (ii) it verifies each claim using entailment-based reasoning and confidence calibration. We evaluate DAVinCI across multiple datasets, including FEVER and CLIMATE-FEVER, and compare its performance against standard verification-only baselines. Our results show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
