DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering
Tong Wang, Chi Jin, Yongkang Chen, Huan Deng, Xiaohui Kuang, Gang Zhao

TL;DR
DataFactory introduces a multi-agent framework that enhances table question answering by improving reasoning, reducing hallucinations, and enabling flexible collaboration among specialized agents, leading to significant accuracy gains.
Contribution
The paper presents a novel multi-agent system with automated knowledge transformation and adaptive planning to address LLM limitations in TableQA.
Findings
20.2% accuracy improvement on TabFact
23.9% accuracy improvement on WikiTQ
Team coordination outperforms single-agent systems
Abstract
Table Question Answering (TableQA) enables natural language interaction with structured tabular data. However, existing large language model (LLM) approaches face critical limitations: context length constraints that restrict data handling capabilities, hallucination issues that compromise answer reliability, and single-agent architectures that struggle with complex reasoning scenarios involving semantic relationships and multi-hop logic. This paper introduces DataFactory, a multi-agent framework that addresses these limitations through specialized team coordination and automated knowledge transformation. The framework comprises a Data Leader employing the ReAct paradigm for reasoning orchestration, together with dedicated Database and Knowledge Graph teams, enabling the systematic decomposition of complex queries into structured and relational reasoning tasks. We formalize automated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks
