Agentic Search in the Wild: Intents and Trajectory Dynamics from 14M+ Real Search Requests

Jingjie Ning; Jo\~ao Coelho; Yibo Kong; Yunfan Long; Bruno Martins; Jo\~ao Magalh\~aes; Jamie Callan; Chenyan Xiong

arXiv:2601.17617·cs.IR·April 30, 2026

Agentic Search in the Wild: Intents and Trajectory Dynamics from 14M+ Real Search Requests

Jingjie Ning, Jo\~ao Coelho, Yibo Kong, Yunfan Long, Bruno Martins, Jo\~ao Magalh\~aes, Jamie Callan, Chenyan Xiong

PDF

1 Repo 2 Datasets

TL;DR

This study analyzes 14.44 million search requests to understand how agentic search sessions unfold, revealing behavioral patterns, evidence traceability, and implications for improving search agent strategies.

Contribution

It provides large-scale empirical insights into agentic search behaviors, introduces CTAR for evidence traceability, and releases a comprehensive dataset for future research.

Findings

01

Most sessions are short, with over 90% under ten steps.

02

Fact-seeking sessions show increasing repetition over time.

03

54% of new query terms are traceable to retrieved evidence.

Abstract

LLM-powered search agents are increasingly being used for multi-step information seeking tasks, yet the IR community lacks empirical understanding of how agentic search sessions unfold and how retrieved evidence is reflected in later queries. This paper presents a large-scale log analysis of agentic search based on 14.44M search requests (3.97M sessions) collected from DeepResearchGym, i.e., an open-source search API accessed by external agentic clients. We sessionize the logs, assign session-level intents and step-wise query-reformulation labels using LLM-based annotation, and propose Context-driven Term Adoption Rate (CTAR) to quantify whether newly introduced query terms are lexically traceable to previously retrieved evidence. Our analyses reveal distinctive behavioral patterns. First, over 90\% of multi-turn sessions contain at most ten steps, and 89\% of inter-step intervals fall…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://huggingface.co/datasets/cx-cmu/deepresearchgym-agentic-search-logs
github

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.