MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval

Chunyu Li; Mengyuan Zhang; Jingyi Kang; Ding Chen; Jiajun Shen; Bo Tang; Xuanhe Zhou; Feiyu Xiong; Zhiyu Li

arXiv:2605.06132·cs.CL·May 15, 2026

MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval

Chunyu Li, Mengyuan Zhang, Jingyi Kang, Ding Chen, Jiajun Shen, Bo Tang, Xuanhe Zhou, Feiyu Xiong, Zhiyu Li

PDF

2 Models

TL;DR

MemReranker is a reasoning-aware reranking model for agent memory retrieval that improves relevance calibration and reasoning capabilities, outperforming existing models on benchmark tasks with lower latency.

Contribution

Introduces MemReranker, a multi-stage distillation-based reranker with enhanced reasoning, calibrated scoring, and strong performance on memory retrieval benchmarks.

Findings

01

MemReranker-0.6B outperforms BGE-Reranker and matches larger models.

02

MemReranker-4B achieves 0.737 MAP, comparable to Gemini-3-Flash.

03

Models maintain efficiency and generalize well across domains.

Abstract

In agent memory systems, the reranking model serves as the critical bridge connecting user queries with long-term memory. Most systems adopt the "retrieve-then-rerank" two-stage paradigm, but generic reranking models rely on semantic similarity matching and lack genuine reasoning capabilities, leading to a problem where recalled results are semantically highly relevant yet do not contain the key information needed to answer the question. This deficiency manifests in memory scenarios as three specific problems. First, relevance scores are miscalibrated, making threshold-based filtering difficult. Second, ranking degrades when facing temporal constraints, causal reasoning, and other complex queries. Third, the model cannot leverage dialogue context for semantic disambiguation. This report introduces MemReranker, a reranking model family (0.6B/4B) built on Qwen3-Reranker through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.