SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented   Generation

Zijun Yao; Weijian Qi; Liangming Pan; Shulin Cao; Linmei; Hu; Weichuan Liu; Lei Hou; Juanzi Li

arXiv:2406.19215·cs.CL·June 28, 2024

SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Zijun Yao, Weijian Qi, Liangming Pan, Shulin Cao, Linmei, Hu, Weichuan Liu, Lei Hou, Juanzi Li

PDF

Open Access 1 Repo

TL;DR

SeaKR is an adaptive retrieval-augmented generation model that leverages LLMs' self-aware uncertainty to selectively retrieve, re-rank, and choose reasoning strategies, improving performance on question answering tasks.

Contribution

Introduces SeaKR, a novel adaptive RAG model that uses LLMs' self-aware uncertainty for retrieval, re-ranking, and reasoning strategy selection.

Findings

01

Outperforms existing adaptive RAG methods on QA datasets

02

Effectively utilizes self-aware uncertainty for retrieval and reasoning

03

Enhances complex task performance with multiple retrievals

Abstract

This paper introduces Self-aware Knowledge Retrieval (SeaKR), a novel adaptive RAG model that extracts self-aware uncertainty of LLMs from their internal states. SeaKR activates retrieval when the LLMs present high self-aware uncertainty for generation. To effectively integrate retrieved knowledge snippets, SeaKR re-ranks them based on LLM's self-aware uncertainty to preserve the snippet that reduces their uncertainty to the utmost. To facilitate solving complex tasks that require multiple retrievals, SeaKR utilizes their self-aware uncertainty to choose among different reasoning strategies. Our experiments on both complex and simple Question Answering datasets show that SeaKR outperforms existing adaptive RAG methods. We release our code at https://github.com/THU-KEG/SeaKR.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thu-keg/seakr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Intelligent Tutoring Systems and Adaptive Learning

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · WordPiece · Softmax · Layer Normalization · Linear Warmup With Linear Decay · Byte Pair Encoding · Attention Dropout · Dropout