Multiple Abstraction Level Retrieve Augment Generation
Zheng Zheng (1), Xinyi Ni (1), Pengyu Hong (1) ((1) Brandeis, University)

TL;DR
This paper introduces a multi-abstraction level retrieval-augmented generation method that enhances answer correctness in scientific question-answering by retrieving information at various abstraction levels, addressing limitations of single-level retrieval.
Contribution
It proposes a novel RAG approach using chunks of multiple abstraction levels, improving answer accuracy in scientific domains compared to traditional methods.
Findings
25.739% improvement in answer correctness on Glyco-related papers
Effective across multiple abstraction levels of information
Addresses the 'lost in the middle' problem in RAG systems
Abstract
A Retrieval-Augmented Generation (RAG) model powered by a large language model (LLM) provides a faster and more cost-effective solution for adapting to new data and knowledge. It also delivers more specialized responses compared to pre-trained LLMs. However, most existing approaches rely on retrieving prefix-sized chunks as references to support question-answering (Q/A). This approach is often deployed to address information needs at a single level of abstraction, as it struggles to generate answers across multiple levels of abstraction. In an RAG setting, while LLMs can summarize and answer questions effectively when provided with sufficient details, retrieving excessive information often leads to the 'lost in the middle' problem and exceeds token limitations. We propose a novel RAG approach that uses chunks of multiple abstraction levels (MAL), including multi-sentence-level,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment · Image Retrieval and Classification Techniques · Neural Networks and Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Layer Normalization · Softmax · Linear Warmup With Linear Decay · Adam · Residual Connection · Dropout · Byte Pair Encoding · Linear Layer
