TL;DR
MemReranker is a reasoning-aware reranking model for agent memory retrieval that improves relevance calibration and reasoning capabilities, outperforming existing models on benchmark tasks with lower latency.
Contribution
Introduces MemReranker, a multi-stage distillation-based reranker with enhanced reasoning, calibrated scoring, and strong performance on memory retrieval benchmarks.
Findings
MemReranker-0.6B outperforms BGE-Reranker and matches larger models.
MemReranker-4B achieves 0.737 MAP, comparable to Gemini-3-Flash.
Models maintain efficiency and generalize well across domains.
Abstract
In agent memory systems, the reranking model serves as the critical bridge connecting user queries with long-term memory. Most systems adopt the "retrieve-then-rerank" two-stage paradigm, but generic reranking models rely on semantic similarity matching and lack genuine reasoning capabilities, leading to a problem where recalled results are semantically highly relevant yet do not contain the key information needed to answer the question. This deficiency manifests in memory scenarios as three specific problems. First, relevance scores are miscalibrated, making threshold-based filtering difficult. Second, ranking degrades when facing temporal constraints, causal reasoning, and other complex queries. Third, the model cannot leverage dialogue context for semantic disambiguation. This report introduces MemReranker, a reranking model family (0.6B/4B) built on Qwen3-Reranker through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
