Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries
Jonathan F\"urst, Catherine Kosten, Farhad Nooralahzadeh, Yi Zhang,, Kurt Stockinger

TL;DR
This paper evaluates the robustness of Text-to-SQL systems using real user queries from a live deployment during the FIFA World Cup 2022, highlighting challenges and providing a new benchmark dataset for real-world scenarios.
Contribution
It presents the first systematic evaluation of Text-to-SQL systems' robustness across different data models in real-world conditions and introduces a new, challenging benchmark dataset.
Findings
Performance varies significantly across data models.
Training data size and processing steps impact system robustness.
The new benchmark dataset is more complex than previous ones.
Abstract
Text-to-SQL systems (also known as NL-to-SQL systems) have become an increasingly popular solution for bridging the gap between user capabilities and SQL-based data access. These systems translate user requests in natural language to valid SQL statements for a specific database. Recent Text-to-SQL systems have benefited from the rapid improvement of transformer-based language models. However, while Text-to-SQL systems that incorporate such models continuously reach new high scores on -- often synthetic -- benchmark datasets, a systematic exploration of their robustness towards different data models in a real-world, realistic scenario is notably missing. This paper provides the first in-depth evaluation of the data model robustness of Text-to-SQL systems in practice based on a multi-year international project focused on Text-to-SQL interfaces. Our evaluation is based on a real-world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Advanced Database Systems and Queries
