Error Detection for Text-to-SQL Semantic Parsing
Shijie Chen, Ziru Chen, Huan Sun, Yu Su

TL;DR
This paper introduces a parser-independent error detection model for text-to-SQL semantic parsing that leverages code language models and graph neural networks to improve error identification and overall parser performance.
Contribution
We propose a novel error detection approach that is independent of specific parsers, utilizing language models and graph neural networks trained on realistic errors for better generalization.
Findings
Outperforms parser-dependent uncertainty metrics in error detection
Enhances the performance of various text-to-SQL parsers
Improves usability and trustworthiness of semantic parsing systems
Abstract
Despite remarkable progress in text-to-SQL semantic parsing in recent years, the performance of existing parsers is still far from perfect. Specifically, modern text-to-SQL parsers based on deep learning are often over-confident, thus casting doubt on their trustworthiness when deployed for real use. In this paper, we propose a parser-independent error detection model for text-to-SQL semantic parsing. Using a language model of code as its bedrock, we enhance our error detection model with graph neural networks that learn structural features of both natural language questions and SQL queries. We train our model on realistic parsing errors collected from a cross-domain setting, which leads to stronger generalization ability. Experiments with three strong text-to-SQL parsers featuring different decoding mechanisms show that our approach outperforms parser-dependent uncertainty metrics. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
