Dynamic Uncertainty Ranking: Enhancing Retrieval-Augmented In-Context   Learning for Long-Tail Knowledge in LLMs

Shuyang Yu; Runxue Bao; Parminder Bhatia; Taha Kass-Hout; Jiayu Zhou,; Cao Xiao

arXiv:2410.23605·cs.CL·February 11, 2025

Dynamic Uncertainty Ranking: Enhancing Retrieval-Augmented In-Context Learning for Long-Tail Knowledge in LLMs

Shuyang Yu, Runxue Bao, Parminder Bhatia, Taha Kass-Hout, Jiayu Zhou,, Cao Xiao

PDF

Open Access 1 Video

TL;DR

This paper introduces a reinforcement learning-based dynamic uncertainty ranking method for retrieval-augmented in-context learning, significantly improving long-tail knowledge retrieval and accuracy in large language models.

Contribution

It proposes a novel dynamic uncertainty ranking approach that prioritizes informative samples and adapts thresholds, enhancing long-tail knowledge retrieval in LLMs.

Findings

01

Outperforms baseline by 2.76% in overall accuracy.

02

Achieves 5.96% improvement on long-tail questions.

03

Reduces query costs with a learnable ranking threshold.

Abstract

Large language models (LLMs) can learn vast amounts of knowledge from diverse domains during pre-training. However, long-tail knowledge from specialized domains is often scarce and underrepresented, rarely appearing in the models' memorization. Prior work has shown that in-context learning (ICL) with retriever augmentation can help LLMs better capture long-tail knowledge, reducing their reliance on pre-trained data. Despite these advances, we observe that LLM predictions for long-tail questions remain uncertain to variations in retrieved samples. To take advantage of the uncertainty in ICL for guiding LLM predictions toward correct answers on long-tail samples, we propose a reinforcement learning-based dynamic uncertainty ranking method for ICL that accounts for the varying impact of each retrieved sample on LLM predictions. Our approach prioritizes more informative and stable samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Dynamic Uncertainty Ranking: Enhancing Retrieval-Augmented In-Context Learning for Long-Tail Knowledge in LLMs· underline

Taxonomy

TopicsData Quality and Management · Machine Learning and Algorithms