A Demonstration of SQLyzr: A Platform for Fine-Grained Text-to-SQL Evaluation and Analysis

Sepideh Abedini; M. Tamer \"Ozsu

arXiv:2604.21214·cs.DB·April 29, 2026

A Demonstration of SQLyzr: A Platform for Fine-Grained Text-to-SQL Evaluation and Analysis

Sepideh Abedini, M. Tamer \"Ozsu

PDF

1 Repo

TL;DR

SQLyzr is a comprehensive platform for evaluating text-to-SQL models, offering diverse metrics, realistic workload simulation, and detailed analysis tools to improve model performance.

Contribution

It introduces SQLyzr, a new benchmark platform with fine-grained evaluation, workload realism, and diagnostic features for text-to-SQL models.

Findings

01

Supports realistic SQL workload evaluation

02

Enables detailed error analysis and query classification

03

Provides an interactive interface for model assessment

Abstract

Text-to-SQL models have significantly improved with the adoption of Large Language Models (LLMs), leading to their increasing use in real-world applications. Although many benchmarks exist for evaluating the performance of text-to-SQL models, they often rely on a single aggregate score, lack evaluation under realistic settings, and provide limited insight into model behaviour across different query types. In this work, we present SQLyzr, a comprehensive benchmark and evaluation platform for text-to-SQL models. SQLyzr incorporates a diverse set of evaluation metrics that capture multiple aspects of generated queries, while enabling more realistic evaluation through workload alignment with real-world SQL usage patterns and database scaling. It further supports fine-grained query classification, error analysis, and workload augmentation, allowing users to better diagnose and improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sepideh-abedini/SQLyzr
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.