AgenticScholar: Agentic Data Management with Pipeline Orchestration for Scholarly Corpora
Hai Lan, Tingting Wang, Zhifeng Bao, Guoliang Li, Daomin Ji, Ge Lee, Feng Luo, Zi Huang, Hailang Qiu, and Gang Hua

TL;DR
AgenticScholar is a comprehensive system that unifies knowledge management, query planning, and execution to enable effective, efficient, and interpretable scholarly data analysis and reasoning at scale.
Contribution
It introduces a novel agentic data management system with integrated knowledge representation, hybrid query planning, and DAG-based execution for scholarly corpora.
Findings
Outperforms existing systems in effectiveness and efficiency
Provides interpretable end-to-end reasoning over scholarly data
Supports diverse scholarly queries from retrieval to knowledge discovery
Abstract
Managing the rapidly growing scholarly corpus poses significant challenges in representation, reasoning, and efficient analysis. An ideal system should unify structured knowledge management, agentic planning, and interpretable execution to support diverse scholarly queries - from retrieval to knowledge discovery and generation - at scale. Unfortunately, existing RAG and document analytics systems fail to achieve all query types simultaneously. To this end, we propose AgenticScholar, an agentic scholarly data management system that integrates a structure-aware knowledge representation layer, an LLM-centric hybrid query planning layer, and a unified execution layer with composable operators. AgenticScholar autonomously translates natural language queries into executable DAG plans, enabling end-to-end reasoning over multi-modal scholarly data. Extensive experiments demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Multi-Agent Systems and Negotiation · Topic Modeling
