Agentic Search in the Wild: Intents and Trajectory Dynamics from 14M+ Real Search Requests
Jingjie Ning, Jo\~ao Coelho, Yibo Kong, Yunfan Long, Bruno Martins, Jo\~ao Magalh\~aes, Jamie Callan, Chenyan Xiong

TL;DR
This study analyzes 14.44 million search requests to understand how agentic search sessions unfold, revealing behavioral patterns, evidence traceability, and implications for improving search agent strategies.
Contribution
It provides large-scale empirical insights into agentic search behaviors, introduces CTAR for evidence traceability, and releases a comprehensive dataset for future research.
Findings
Most sessions are short, with over 90% under ten steps.
Fact-seeking sessions show increasing repetition over time.
54% of new query terms are traceable to retrieved evidence.
Abstract
LLM-powered search agents are increasingly being used for multi-step information seeking tasks, yet the IR community lacks empirical understanding of how agentic search sessions unfold and how retrieved evidence is reflected in later queries. This paper presents a large-scale log analysis of agentic search based on 14.44M search requests (3.97M sessions) collected from DeepResearchGym, i.e., an open-source search API accessed by external agentic clients. We sessionize the logs, assign session-level intents and step-wise query-reformulation labels using LLM-based annotation, and propose Context-driven Term Adoption Rate (CTAR) to quantify whether newly introduced query terms are lexically traceable to previously retrieved evidence. Our analyses reveal distinctive behavioral patterns. First, over 90\% of multi-turn sessions contain at most ten steps, and 89\% of inter-step intervals fall…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
