Multiple Abstraction Level Retrieve Augment Generation

Zheng Zheng (1); Xinyi Ni (1); Pengyu Hong (1) ((1) Brandeis; University)

arXiv:2501.16952·cs.CL·January 29, 2025

Multiple Abstraction Level Retrieve Augment Generation

Zheng Zheng (1), Xinyi Ni (1), Pengyu Hong (1) ((1) Brandeis, University)

PDF

Open Access

TL;DR

This paper introduces a multi-abstraction level retrieval-augmented generation method that enhances answer correctness in scientific question-answering by retrieving information at various abstraction levels, addressing limitations of single-level retrieval.

Contribution

It proposes a novel RAG approach using chunks of multiple abstraction levels, improving answer accuracy in scientific domains compared to traditional methods.

Findings

01

25.739% improvement in answer correctness on Glyco-related papers

02

Effective across multiple abstraction levels of information

03

Addresses the 'lost in the middle' problem in RAG systems

Abstract

A Retrieval-Augmented Generation (RAG) model powered by a large language model (LLM) provides a faster and more cost-effective solution for adapting to new data and knowledge. It also delivers more specialized responses compared to pre-trained LLMs. However, most existing approaches rely on retrieving prefix-sized chunks as references to support question-answering (Q/A). This approach is often deployed to address information needs at a single level of abstraction, as it struggles to generate answers across multiple levels of abstraction. In an RAG setting, while LLMs can summarize and answer questions effectively when provided with sufficient details, retrieving excessive information often leads to the 'lost in the middle' problem and exceeds token limitations. We propose a novel RAG approach that uses chunks of multiple abstraction levels (MAL), including multi-sentence-level,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment · Image Retrieval and Classification Techniques · Neural Networks and Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Layer Normalization · Softmax · Linear Warmup With Linear Decay · Adam · Residual Connection · Dropout · Byte Pair Encoding · Linear Layer