Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases
Michael Theologitis, Dan Suciu

TL;DR
Thucy is a novel multi-agent system leveraging LLMs to verify claims across multiple relational databases, providing transparent evidence and surpassing previous benchmarks in accuracy.
Contribution
It introduces the first cross-database, multi-table claim verification system that is data-source agnostic and offers explicit SQL-based evidence for its verdicts.
Findings
Achieved 94.3% accuracy on TabFact, surpassing previous state-of-the-art by 5.6%.
Provides transparent SQL queries supporting each verification decision.
Operates effectively across multiple relational databases without prior data source knowledge.
Abstract
In today's age, it is becoming increasingly difficult to decipher truth from lies. Every day, politicians, media outlets, and public figures make conflicting claims -- often about topics that can, in principle, be verified against structured data. For instance, statements about crime rates, economic growth or healthcare can all be verified against official public records and structured datasets. Building a system that can automatically do that would have sounded like science fiction just a few years ago. Yet, with the extraordinary progress in LLMs and agentic AI, this is now within reach. Still, there remains a striking gap between what is technically possible and what is being demonstrated by recent work. Most existing verification systems operate only on small, single-table databases -- typically a few hundred rows -- that conveniently fit within an LLM's context window. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Quality and Management · Scientific Computing and Data Management · Big Data and Digital Economy
