Enhancing Retrieval Processes for Language Generation with Augmented Queries
Julien Pierre Edmond Ghali, Kosuke Shima, Koichi Moriyama, Atsuko, Mutoh, Nobuhiro Inuzuka

TL;DR
This paper improves document retrieval for language models by integrating retrieval-augmented generation with query optimization, demonstrating enhanced accuracy and efficiency across different models and scenarios.
Contribution
It introduces an innovative query optimization process combined with RAG, utilizing models like Orca2 7B and BERT with UMAP to enhance retrieval scalability and accuracy.
Findings
Significant performance improvement with RAG and prompt augmenters.
Effective document retrieval using language model-generated queries.
UMAP simplifies retrieval while maintaining strong results.
Abstract
In the rapidly changing world of smart technology, searching for documents has become more challenging due to the rise of advanced language models. These models sometimes face difficulties, like providing inaccurate information, commonly known as "hallucination." This research focuses on addressing this issue through Retrieval-Augmented Generation (RAG), a technique that guides models to give accurate responses based on real facts. To overcome scalability issues, the study explores connecting user queries with sophisticated language models such as BERT and Orca2, using an innovative query optimization process. The study unfolds in three scenarios: first, without RAG, second, without additional assistance, and finally, with extra help. Choosing the compact yet efficient Orca2 7B model demonstrates a smart use of computing resources. The empirical results indicate a significant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Dropout · Layer Normalization · Byte Pair Encoding · Attention Dropout · Multi-Head Attention · Linear Warmup With Linear Decay · Dense Connections · Adam
