ERATTA: Extreme RAG for Table To Answers with Large Language Models

Sohini Roychowdhury; Marko Krema; Anvar Mahammad; Brian Moore; Arijit; Mukherjee; Punit Prakashchandra

arXiv:2405.03963·cs.AI·November 19, 2024

ERATTA: Extreme RAG for Table To Answers with Large Language Models

Sohini Roychowdhury, Marko Krema, Anvar Mahammad, Brian Moore, Arijit, Mukherjee, Punit Prakashchandra

PDF

Open Access

TL;DR

The paper introduces ERATTA, an advanced retrieval-augmented generation system utilizing multiple large language models for fast, reliable, and structured question-answering from large, fluctuating enterprise data tables, with hallucination detection.

Contribution

It presents a novel multi-LLM framework with a five-metric hallucination detection system tailored for enterprise data table question-answering.

Findings

01

Achieves structured responses in under 10 seconds per query.

02

Reports over 90% confidence scores in multiple domains.

03

Detects and reports hallucinations effectively.

Abstract

Large language models (LLMs) with retrieval augmented-generation (RAG) have been the optimal choice for scalable generative AI solutions in the recent past. Although RAG implemented with AI agents (agentic-RAG) has been recently popularized, its suffers from unstable cost and unreliable performances for Enterprise-level data-practices. Most existing use-cases that incorporate RAG with LLMs have been either generic or extremely domain specific, thereby questioning the scalability and generalizability of RAG-LLM approaches. In this work, we propose a unique LLM-based system where multiple LLMs can be invoked to enable data authentication, user-query routing, data-retrieval and custom prompting for question-answering capabilities from Enterprise-data tables. The source tables here are highly fluctuating and large in size and the proposed framework enables structured responses in under 10…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Weight Decay · Multi-Head Attention · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Dropout · Residual Connection · Softmax · WordPiece