Towards Robustness of Text-to-SQL Models Against Natural and Realistic   Adversarial Table Perturbation

Xinyu Pi; Bing Wang; Yan Gao; Jiaqi Guo; Zhoujun Li; Jian-Guang Lou

arXiv:2212.09994·cs.CL·December 21, 2022

Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation

Xinyu Pi, Bing Wang, Yan Gao, Jiaqi Guo, Zhoujun Li, Jian-Guang Lou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new adversarial attack paradigm on tables for Text-to-SQL models, creates a benchmark to evaluate robustness, and proposes adversarial training to improve model resilience against table perturbations.

Contribution

It proposes Adversarial Table Perturbation (ATP), curates the ADVETA benchmark, and develops a systematic adversarial training framework to enhance robustness of Text-to-SQL models.

Findings

01

Models' performance drops significantly on ADVETA.

02

Adversarial training improves robustness against table perturbations.

03

The approach also enhances resilience to natural language perturbations.

Abstract

The robustness of Text-to-SQL parsers against adversarial perturbations plays a crucial role in delivering highly reliable applications. Previous studies along this line primarily focused on perturbations in the natural language question side, neglecting the variability of tables. Motivated by this, we propose the Adversarial Table Perturbation (ATP) as a new attacking paradigm to measure the robustness of Text-to-SQL models. Following this proposition, we curate ADVETA, the first robustness evaluation benchmark featuring natural and realistic ATPs. All tested state-of-the-art models experience dramatic performance drops on ADVETA, revealing models' vulnerability in real-world practices. To defend against ATP, we build a systematic adversarial training example generation framework tailored for better contextualization of tabular data. Experiments show that our approach not only brings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/ContextualSP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning