A Survey of Query Optimization in Large Language Models
Mingyang Song, Mao Zheng

TL;DR
This survey systematically analyzes query optimization techniques in large language models, introducing a unified framework, taxonomy, and detailed analysis of atomic operations, while highlighting current challenges and future directions.
Contribution
It introduces the Query Optimization Lifecycle framework, a query complexity taxonomy, and provides an in-depth analysis of atomic query operations in LLMs.
Findings
Proposes the Query Optimization Lifecycle framework.
Classifies queries using a new taxonomy based on evidence type and quantity.
Analyzes four atomic query operations and discusses evaluation gaps.
Abstract
Query Optimization (QO) has become essential for enhancing Large Language Model (LLM) effectiveness, particularly in Retrieval-Augmented Generation (RAG) systems where query quality directly determines retrieval and response performance. This survey provides a systematic and comprehensive analysis of query optimization techniques with three principal contributions. \textit{First}, we introduce the \textbf{Query Optimization Lifecycle (QOL) Framework}, a five-phase pipeline covering Intent Recognition, Query Transformation, Retrieval Execution, Evidence Integration, and Response Synthesis, providing a unified lens for understanding the optimization process. \textit{Second}, we propose a \textbf{Query Complexity Taxonomy} that classifies queries along two dimensions, namely evidence type (explicit vs.\ implicit) and evidence quantity (single vs.\ multiple), establishing principled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Advanced Graph Neural Networks
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Residual Connection · Adam · Weight Decay · Multi-Head Attention · Layer Normalization · WordPiece · Dropout
