Prompt Generate Train (PGT): Few-shot Domain Adaption of Retrieval Augmented Generation Models for Open Book Question-Answering
C. S. Krishna

TL;DR
This paper introduces PGT, a framework for few-shot domain adaptation of retrieval-augmented generation models to improve open-book question answering over proprietary texts, aiming for cost-effective and accurate answers.
Contribution
The paper presents a novel pipeline combining synthetic data generation, supervised fine-tuning, reinforcement learning, and uncertainty calibration for domain-specific RAG models.
Findings
Synthetic data generation improves domain relevance.
Reinforcement learning enhances answer grounding.
Model achieves competitive performance with lower costs.
Abstract
We propose a framework - Prompt, Generate, Train (PGT) - to efficiently develop a generative question-answering model for open-book question-answering over a proprietary collection of text documents. The framework adapts a retriever augmented generation (RAG) model to the target domain using supervised fine-tuning and reinforcement learning with synthetic feedback in a few-shot setting. This, we hypothesize, will yield an aligned, uncertainty calibrated model that is competitive with GPT-4 based in-context retrieval augmented generation in generating relevant answers at lower serving costs. The framework's synthetic generation pipeline will generate synthetic training data comprising <passage, question, answer> tuples using an open-source LLM and a novel consistency filtering scheme. The pipeline will be designed to generate both abstractive and extractive questions that span the entire…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Expert finding and Q&A systems
MethodsMulti-Head Attention · Attention Is All You Need · ALIGN · Absolute Position Encodings · Label Smoothing · Linear Layer · Attention Dropout · Softmax · Position-Wise Feed-Forward Layer · Byte Pair Encoding
