RaDeR: Reasoning-aware Dense Retrieval Models
Debrup Das, Sam O' Nuallain, Razieh Rahimi

TL;DR
RaDeR introduces reasoning-aware dense retrieval models trained on data from mathematical problem solving, significantly improving performance on reasoning tasks and outperforming traditional methods like BM25.
Contribution
RaDeR is the first dense retriever to outperform BM25 on reasoning steps, leveraging reasoning trajectories and self-reflective relevance evaluation for training.
Findings
RaDeR outperforms strong baselines on BRIGHT and RAR-b benchmarks.
RaDeR surpasses BM25 on Chain-of-Thought reasoning queries.
RaDeR achieves high performance with only 2.5% of the training data of REASONIR.
Abstract
We propose RaDeR, a set of reasoning-based dense retrieval models trained with data derived from mathematical problem solving using large language models (LLMs). Our method leverages retrieval-augmented reasoning trajectories of an LLM and self-reflective relevance evaluation, enabling the creation of both diverse and hard-negative samples for reasoning-intensive relevance. RaDeR retrievers, trained for mathematical reasoning, effectively generalize to diverse reasoning tasks in the BRIGHT and RAR-b benchmarks, consistently outperforming strong baselines in overall performance. Notably, RaDeR achieves significantly higher performance than baselines on the Math and Coding splits. In addition, RaDeR presents the first dense retriever that outperforms BM25 when queries are Chain-of-Thought reasoning steps, underscoring the critical role of reasoning-based retrieval to augment reasoning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Raderspace/RaDeR_Qwen_25_7B_instruct_MATH_LLMq_CoT_lexicalmodel· 13 dl13 dl
- 🤗Raderspace/RaDeR_gte_Qwen2-7B_MATH_LLMq_CoT_lexicalmodel· 2 dl2 dl
- 🤗Raderspace/RaDeR_Qwen25-7B_NuminaMath_MATH_allquerytypesmodel· 1 dl· ♡ 11 dl♡ 1
- 🤗Raderspace/RaDeR_Qwen25_3B_NuminaMath_MATH_allquerytypesmodel· 1 dl· ♡ 21 dl♡ 2
- 🤗Raderspace/RaDeR_Qwen25-14B_NuminaMath_MATH_allquerytypesmodel· 2 dl2 dl
Videos
Taxonomy
TopicsTopic Modeling · Neural Networks and Applications · Semantic Web and Ontologies
