TL;DR
PaSaMaster is a self-evolving, intent-aware literature retrieval system that improves accuracy, reduces hallucinations, and enhances cost efficiency by iterative search and relevance ranking.
Contribution
It introduces a novel iterative search process that evolves over time, separating intent understanding from retrieval to improve scientific literature search.
Findings
Outperforms traditional keyword retrieval with 15.6X F1-score improvement.
Reduces hallucination rates in literature retrieval to zero.
Achieves 30% better performance than GPT-5.2 at 1% of the computational cost.
Abstract
As large language models reshape scientific research, literature retrieval faces a twofold challenge: ensuring source authenticity while maintaining a deep comprehension of academic search intents. While reliable, traditional keyword-centric search fails to capture complex research intents. Frontier LLMs can handle complex research intents, but their high cost and tendency to hallucinate remain key limitations. Here we introduce PaSaMaster, a self-evolving agentic literature retrieval system that produces relevance-scored paper rankings with evidence-grounded recommendations through iterative intent analysis, retrieval, and ranking. It is built on three key designs. First, it transforms literature retrieval from a one shot query--document matching problem into a search process that evolves over time, using ranked evidence to reveal gaps, refine intents, and guide follow-up searches.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
