RAG based Question-Answering for Contextual Response Prediction System
Sriram Veturi, Saurabh Vaichal, Reshma Lal Jagadheesh, Nafis Irtiza, Tripto, Nian Yan

TL;DR
This paper presents an end-to-end RAG-based question-answering system tailored for customer service, demonstrating improved accuracy and relevance over existing methods through automated and human evaluations.
Contribution
It introduces a practical RAG framework for industry use, addressing data, evaluation, and cost challenges in deploying LLMs for customer support.
Findings
Outperforms BERT-based algorithms in accuracy and relevance
Automated and human evaluations validate effectiveness
Supports customer service agents by reducing workload
Abstract
Large Language Models (LLMs) have shown versatility in various Natural Language Processing (NLP) tasks, including their potential as effective question-answering systems. However, to provide precise and relevant information in response to specific customer queries in industry settings, LLMs require access to a comprehensive knowledge base to avoid hallucinations. Retrieval Augmented Generation (RAG) emerges as a promising technique to address this challenge. Yet, developing an accurate question-answering framework for real-world applications using RAG entails several challenges: 1) data availability issues, 2) evaluating the quality of generated content, and 3) the costly nature of human evaluation. In this paper, we introduce an end-to-end framework that employs LLMs with RAG capabilities for industry use cases. Given a customer query, the proposed system retrieves relevant knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSeismology and Earthquake Studies · Text and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · travel james · Attention Is All You Need · Byte Pair Encoding · Softmax · Dropout · Layer Normalization · Linear Layer · Adam · Weight Decay
