Dynamic Retrieval-Augmented Generation
Anton Shapkin, Denis Litvinov, Yaroslav Zharov, Egor Bogomolov, Timur, Galimzyanov, Timofey Bryksin

TL;DR
The paper introduces DRAG, a novel retrieval-augmented generation method that injects compressed entity embeddings to enhance context length and accuracy, especially in code generation, outperforming existing baselines.
Contribution
It proposes a new dynamic retrieval-augmented generation approach that overcomes context window limitations by embedding retrieved entities, with a new dataset for code generation tasks.
Findings
Achieves longer context handling and larger retrieval sets.
Outperforms all baselines except GPT-3.5.
Reduces issues with entity misspelling and retrieval failures.
Abstract
Current state-of-the-art large language models are effective in generating high-quality text and encapsulating a broad spectrum of world knowledge. These models, however, often hallucinate and lack locally relevant factual data. Retrieval-augmented approaches were introduced to overcome these problems and provide more accurate responses. Typically, the retrieved information is simply appended to the main request, restricting the context window size of the model. We propose a novel approach for the Dynamic Retrieval-Augmented Generation (DRAG), based on the entity-augmented generation, which injects compressed embeddings of the retrieved entities into the generative model. The proposed pipeline was developed for code-generation tasks, yet can be transferred to some domains of natural language processing. To train the model, we collect and publish a new project-level code generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Attention Dropout · Dense Connections · Cosine Annealing · {Dispute@FaQ-s}How to file a dispute with Expedia? · Adam · 15 Ways to Contact How can i speak to someone at Delta Airlines
