DeAR: Dual-Stage Document Reranking with Reasoning Agents via LLM Distillation
Abdelrahman Abdallah, Jamshid Mozafari, Bhawna Piryani, Adam Jatowt

TL;DR
DeAR introduces a dual-stage document reranking framework that distills relevance signals and employs listwise reasoning with natural-language justifications, achieving superior accuracy and interpretability over existing methods.
Contribution
It proposes a novel dual-stage approach with distillation and reasoning adapters, enhancing reranking performance and interpretability without extensive fine-tuning.
Findings
Outperforms open-source baselines on multiple datasets.
Achieves state-of-the-art results on TREC-DL19/20 and NovelEval.
Demonstrates effective open-domain QA performance.
Abstract
Large Language Models (LLMs) have transformed listwise document reranking by enabling global reasoning over candidate sets, yet single models often struggle to balance fine-grained relevance scoring with holistic cross-document analysis. We propose \textbf{De}ep\textbf{A}gent\textbf{R}ank (\textbf{\DeAR}), an open-source framework that decouples these tasks through a dual-stage approach, achieving superior accuracy and interpretability. In \emph{Stage 1}, we distill token-level relevance signals from a frozen 13B LLaMA teacher into a compact \{3, 8\}B student model using a hybrid of cross-entropy, RankNet, and KL divergence losses, ensuring robust pointwise scoring. In \emph{Stage 2}, we attach a second LoRA adapter and fine-tune on 20K GPT-4o-generated chain-of-thought permutations, enabling listwise reasoning with natural-language justifications. Evaluated on TREC-DL19/20, eight BEIR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗abdoelsayed/llama2-13b-rankllama-teachermodel· 1 dl1 dl
- 🤗abdoelsayed/dear-3b-reranker-ce-lora-v1model· 4 dl4 dl
- 🤗abdoelsayed/dear-3b-reranker-ce-v1model· 1 dl1 dl
- 🤗abdoelsayed/dear-3b-reranker-ranknet-lora-v1model· 1 dl1 dl
- 🤗abdoelsayed/dear-3b-reranker-ranknet-v1model· 10 dl10 dl
- 🤗abdoelsayed/dear-8b-reranker-ce-lora-v1model· 1 dl1 dl
- 🤗abdoelsayed/dear-8b-reranker-ranknet-lora-v1model· 2 dl2 dl
- 🤗abdoelsayed/dear-8b-reranker-ce-v1model· 2 dl2 dl
- 🤗abdoelsayed/dear-8b-reranker-ranknet-v1model· 1 dl1 dl
- 🤗abdoelsayed/dear-8b-reranker-listwise-lora-v1model· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
