DeepRead: Document Structure-Aware Reasoning to Enhance Agentic Search

Zhanli Li; Huiwen Tian; Lvzhou Luo; Yixuan Cao; Ping Luo

arXiv:2602.05014·cs.AI·February 13, 2026

DeepRead: Document Structure-Aware Reasoning to Enhance Agentic Search

Zhanli Li, Huiwen Tian, Lvzhou Luo, Yixuan Cao, Ping Luo

PDF

Open Access

TL;DR

DeepRead introduces a structure-aware reasoning agent that leverages document hierarchy and sequential logic to improve multi-turn evidence gathering in large language models, outperforming traditional unstructured retrieval methods.

Contribution

It presents DeepRead, a novel document reasoning framework that incorporates native document structure into LLM-based agentic search, enabling more human-like, accurate document comprehension.

Findings

01

DeepRead outperforms baseline methods by 10.3% on four benchmarks.

02

The system effectively mimics human locate-then-read strategies.

03

Structural awareness enhances reasoning accuracy in document retrieval.

Abstract

With the rapid advancement of tool-use capabilities in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) is shifting from static, one-shot retrieval toward autonomous, multi-turn evidence acquisition. However, existing agentic search frameworks typically treat long documents as flat collections of unstructured chunks, disregarding the native hierarchical organization and sequential logic essential for human comprehension. To bridge this gap, we introduce \textbf{DeepRead}, a structure-aware document reasoning agent designed to operationalize document-native structural priors into actionable reasoning capabilities. Leveraging the structural fidelity of modern OCR, DeepRead constructs a paragraph-level, coordinate-based navigation system and equips the LLM with two synergistic tools: \textsf{Retrieve} for scanning-aware localization, and \textsf{ReadSection} for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques