Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA

Rishabh Maheshwary; Masoud Hashemi; Khyati Mahajan; Shiva Krishna Reddy Malay; Sai Rajeswar; Sathwik Tejaswi Madhusudhan; Spandana Gella; and Vikas Yadav

arXiv:2505.16293·cs.CL·May 23, 2025

Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA

Rishabh Maheshwary, Masoud Hashemi, Khyati Mahajan, Shiva Krishna Reddy Malay, Sai Rajeswar, Sathwik Tejaswi Madhusudhan, Spandana Gella, and Vikas Yadav

PDF

Open Access

TL;DR

This paper introduces Notes Writing, a method that enhances iterative retrieval-augmented generation by generating concise notes from retrieved documents, improving reasoning over large contexts in complex question answering tasks.

Contribution

It presents a scalable, model-agnostic notes writing approach that reduces irrelevant information and effectively increases context length for better reasoning in iterative RAG systems.

Findings

01

Average improvement of 15.6 percentage points across datasets

02

Effective integration with multiple RAG methods and models

03

Minimal increase in output tokens

Abstract

Iterative RAG for multi-hop question answering faces challenges with lengthy contexts and the buildup of irrelevant information. This hinders a model's capacity to process and reason over retrieved content and limits performance. While recent methods focus on compressing retrieved information, they are either restricted to single-round RAG, require finetuning or lack scalability in iterative RAG. To address these challenges, we propose Notes Writing, a method that generates concise and relevant notes from retrieved documents at each step, thereby reducing noise and retaining only essential information. This indirectly increases the effective context length of Large Language Models (LLMs), enabling them to reason and plan more effectively while processing larger volumes of input text. Notes Writing is framework agnostic and can be integrated with different iterative RAG methods. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Attention Dropout · Softmax · WordPiece · Weight Decay · Dropout · Adam · Linear Layer