RASPRef: Retrieval-Augmented Self-Supervised Prompt Refinement for Large Reasoning Models
Rahul Soni

TL;DR
RASPRef is a retrieval-augmented, self-supervised framework that iteratively refines prompts for large reasoning models, improving performance without human supervision by leveraging retrieval and feedback signals.
Contribution
It introduces a novel retrieval-guided prompt refinement method that directly optimizes prompts through self-supervised signals, reducing manual prompt engineering.
Findings
Retrieval-guided prompting improves performance over static prompts.
Self-supervised signals like multi-sample consistency and verifier feedback enhance prompt refinement.
Prompt quality significantly impacts reasoning model performance.
Abstract
Recent reasoning-focused language models such as DeepSeek R1 and OpenAI o1 have demonstrated strong performance on structured reasoning benchmarks including GSM8K, MATH, and multi-hop question answering tasks. However, their performance remains highly sensitive to prompt formulation, and designing effective prompts is typically a manual and iterative process that does not scale well across tasks or domains. To address this limitation, we introduce Retrieval-Augmented Self-Supervised Prompt Refinement (RASPRef), a framework that improves prompts without requiring human annotations or task-specific supervision. The approach retrieves relevant examples and previously generated reasoning trajectories, and leverages signals such as multi-sample consistency, verifier feedback, and model-generated critiques to iteratively refine the prompt. Unlike prior approaches that focus primarily on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
