Anatomy of a Query: W5H Dimensions and FAR Patterns for Text-to-SQL Evaluation
Vicki Stover Hertzberg, Eduardo Valverde, Joyce C. Ho

TL;DR
This paper introduces QUEST, a framework for evaluating text-to-SQL systems based on structural invariants and semantic dimensions, revealing domain-specific patterns and reasoning limitations.
Contribution
It formalizes the FAR and W5H frameworks for analyzing query semantics and validates their applicability across diverse datasets.
Findings
FAR conformance is universal across datasets and schemas.
Healthcare queries are concentrated in temporal and person-centric dimensions.
Causal and mechanistic reasoning are nearly absent in current datasets.
Abstract
Natural language interfaces to databases have gained popularity, yet the theoretical foundations for evaluating and designing these systems remain underdeveloped. We present QUEST (Query Understanding Evaluation through Semantic Translation), a framework resting on two independently motivated components: the FAR structural invariant, which holds that every well-formed query reduces to Filter, Aggregate, and Return operations; and the W5H dimensional framework, which holds that all filtering criteria map to six semantic dimensions (Who, What, Where, When, Why, and How). Validated across five text-to-SQL datasets (n = 120,464), FAR conformance is universal across all domains and schema types, while W5H dimensional profiles vary substantially. Healthcare queries are strongly concentrated in temporal (WHEN: 80.4%) and person-centric (WHO: 73.0%) dimensions far exceeding general-domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
