Adaptive Retrieval helps Reasoning in LLMs -- but mostly if it's not used
Srijan Shakya, Anamaria-Roberta Hartl, Sepp Hochreiter, Korbinian P\"oppel

TL;DR
This paper investigates adaptive retrieval in large language models, revealing that selectively choosing when to retrieve external knowledge can improve reasoning, especially when the model decides not to retrieve, highlighting a metacognitive aspect.
Contribution
It introduces an adaptive retrieval-augmented architecture where LLMs decide when to query external knowledge, showing that non-retrieval correlates with better reasoning performance.
Findings
Static retrieval underperforms compared to Chain-of-Thought.
Retrieval rarely aids reasoning; sometimes it hinders performance.
Models scale retrieval frequency with problem difficulty.
Abstract
Large Language Models (LLMs) often falter in complex reasoning tasks due to their static, parametric knowledge, leading to hallucinations and poor performance in specialized domains like mathematics. This work explores a fundamental principle for enhancing generative models: treating retrieval as a form of dynamic in-context learning. We test an adaptive retrieval-augmented architecture where an LLM agent actively decides when to query an external knowledge base during its reasoning process. We compare this adaptive strategy against a standard Chain-of-Thought (CoT) baseline and a static retrieval approach on the GSM8K and MATH-500 benchmarks. Although our experiments show that static retrieval is inferior to CoT, the adaptive retrieval shows interesting behavior: While traces including retrieved results show slightly worse performance compared to CoT, traces that do not include…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
