RASPRef: Retrieval-Augmented Self-Supervised Prompt Refinement for Large Reasoning Models

Rahul Soni

arXiv:2603.27008·cs.CL·March 31, 2026

RASPRef: Retrieval-Augmented Self-Supervised Prompt Refinement for Large Reasoning Models

Rahul Soni

PDF

TL;DR

RASPRef is a retrieval-augmented, self-supervised framework that iteratively refines prompts for large reasoning models, improving performance without human supervision by leveraging retrieval and feedback signals.

Contribution

It introduces a novel retrieval-guided prompt refinement method that directly optimizes prompts through self-supervised signals, reducing manual prompt engineering.

Findings

01

Retrieval-guided prompting improves performance over static prompts.

02

Self-supervised signals like multi-sample consistency and verifier feedback enhance prompt refinement.

03

Prompt quality significantly impacts reasoning model performance.

Abstract

Recent reasoning-focused language models such as DeepSeek R1 and OpenAI o1 have demonstrated strong performance on structured reasoning benchmarks including GSM8K, MATH, and multi-hop question answering tasks. However, their performance remains highly sensitive to prompt formulation, and designing effective prompts is typically a manual and iterative process that does not scale well across tasks or domains. To address this limitation, we introduce Retrieval-Augmented Self-Supervised Prompt Refinement (RASPRef), a framework that improves prompts without requiring human annotations or task-specific supervision. The approach retrieves relevant examples and previously generated reasoning trajectories, and leverages signals such as multi-sample consistency, verifier feedback, and model-generated critiques to iteratively refine the prompt. Unlike prior approaches that focus primarily on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.