Data Agnostic RoBERTa-based Natural Language to SQL Query Generation

Debaditya Pal; Harsh Sharma; Kaustubh Chaudhari

arXiv:2010.05243·cs.AI·March 8, 2021

Data Agnostic RoBERTa-based Natural Language to SQL Query Generation

Debaditya Pal, Harsh Sharma, Kaustubh Chaudhari

PDF

1 Repo

TL;DR

This paper presents a data privacy-focused NL2SQL model using RoBERTa embeddings and data-agnostic knowledge vectors, achieving 76.7% accuracy without relying on table data during training.

Contribution

It introduces a novel approach that eliminates the need for table data during training, enabling zero-shot learning for NL2SQL tasks.

Findings

01

Achieved 76.7% test set execution accuracy.

02

Eliminated dependency on table data during training.

03

Enabled zero-shot learning based on schema and question.

Abstract

Relational databases are among the most widely used architectures to store massive amounts of data in the modern world. However, there is a barrier between these databases and the average user. The user often lacks the knowledge of a query language such as SQL required to interact with the database. The NL2SQL task aims at finding deep learning approaches to solve this problem by converting natural language questions into valid SQL queries. Given the sensitive nature of some databases and the growing need for data privacy, we have presented an approach with data privacy at its core. We have passed RoBERTa embeddings and data-agnostic knowledge vectors into LSTM based submodels to predict the final query. Although we have not achieved state of the art results, we have eliminated the need for the table data, right from the training of the model, and have achieved a test set execution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DebadityaPal/RoBERTa-NL2SQL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Dense Connections · Tanh Activation · WordPiece · Multi-Head Attention · Layer Normalization · Sigmoid Activation · Linear Warmup With Linear Decay · Attention Dropout · Weight Decay