SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search

Xiaofeng Shi; Yuduo Li; Qian Kou; Longbin Yu; Jinxin Xie; Hua Zhou

arXiv:2507.15245·cs.IR·July 22, 2025

SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search

Xiaofeng Shi, Yuduo Li, Qian Kou, Longbin Yu, Jinxin Xie, Hua Zhou

PDF

1 Datasets

TL;DR

SPAR is a novel multi-agent framework utilizing LLMs for more flexible academic literature search, significantly improving retrieval performance and introducing a new benchmark for evaluation.

Contribution

It introduces SPAR, a multi-agent system with query decomposition and evolution, and SPARBench, a new benchmark with expert relevance labels, advancing scholarly retrieval methods.

Findings

01

SPAR outperforms baselines with up to +56% F1 on AutoScholar.

02

SPAR achieves up to +23% F1 on SPARBench.

03

The framework enhances search flexibility and effectiveness.

Abstract

Recent advances in large language models (LLMs) have opened new opportunities for academic literature retrieval. However, existing systems often rely on rigid pipelines and exhibit limited reasoning capabilities. We introduce SPAR, a multi-agent framework that incorporates RefChain-based query decomposition and query evolution to enable more flexible and effective search. To facilitate systematic evaluation, we also construct SPARBench, a challenging benchmark with expert-annotated relevance labels. Experimental results demonstrate that SPAR substantially outperforms strong baselines, achieving up to +56% F1 on AutoScholar and +23% F1 on SPARBench over the best-performing baseline. Together, SPAR and SPARBench provide a scalable, interpretable, and high-performing foundation for advancing research in scholarly retrieval. Code and data will be available at:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

MonteXiaofeng/SPARBench
dataset· 86 dl
86 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.