Agentic Multi-Source Grounding for Enhanced Query Intent Understanding: A DoorDash Case Study
Emmanuel Aboah Boateng, Kyle MacDonald, Akshad Viswanathan, Sudeep Das

TL;DR
This paper presents a grounded multi-source system for query intent understanding in marketplaces, combining catalog retrieval and web search to improve accuracy over traditional classifiers and LLMs.
Contribution
It introduces a flexible, domain-agnostic architecture that integrates multiple grounding sources and deterministic policies for disambiguation, enhancing query understanding at scale.
Findings
Achieved +10.9pp improvement over ungrounded LLM baseline.
Attained +4.6pp improvement over legacy system.
Reaches 90.7% accuracy on long-tail queries, serving over 95% of search impressions.
Abstract
Accurately mapping user queries to business categories is a fundamental Information Retrieval challenge for multi-category marketplaces, where context-sparse queries such as "Wildflower" exhibit intent ambiguity, simultaneously denoting a restaurant chain, a retail product, and a floral item. Traditional classifiers force a winner-takes-all assignment, while general-purpose LLMs hallucinate unavailable inventory. We introduce an Agentic Multi-Source Grounded system that addresses both failure modes by grounding LLM inference in (i) a staged catalog entity retrieval pipeline and (ii) an agentic web-search tool invoked autonomously for cold-start queries. Rather than predicting a single label, the model emits an ordered multi-intent set, resolved by a configurable disambiguation layer that applies deterministic business policies and is designed for extensibility to personalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
