ERATTA: Extreme RAG for Table To Answers with Large Language Models
Sohini Roychowdhury, Marko Krema, Anvar Mahammad, Brian Moore, Arijit, Mukherjee, Punit Prakashchandra

TL;DR
The paper introduces ERATTA, an advanced retrieval-augmented generation system utilizing multiple large language models for fast, reliable, and structured question-answering from large, fluctuating enterprise data tables, with hallucination detection.
Contribution
It presents a novel multi-LLM framework with a five-metric hallucination detection system tailored for enterprise data table question-answering.
Findings
Achieves structured responses in under 10 seconds per query.
Reports over 90% confidence scores in multiple domains.
Detects and reports hallucinations effectively.
Abstract
Large language models (LLMs) with retrieval augmented-generation (RAG) have been the optimal choice for scalable generative AI solutions in the recent past. Although RAG implemented with AI agents (agentic-RAG) has been recently popularized, its suffers from unstable cost and unreliable performances for Enterprise-level data-practices. Most existing use-cases that incorporate RAG with LLMs have been either generic or extremely domain specific, thereby questioning the scalability and generalizability of RAG-LLM approaches. In this work, we propose a unique LLM-based system where multiple LLMs can be invoked to enable data authentication, user-query routing, data-retrieval and custom prompting for question-answering capabilities from Enterprise-data tables. The source tables here are highly fluctuating and large in size and the proposed framework enables structured responses in under 10…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Weight Decay · Multi-Head Attention · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Dropout · Residual Connection · Softmax · WordPiece
