DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using   Bi-Directional Recurrent Neural Networks

Arif Usta; Akifhan Karakayali; \"Ozg\"ur Ulusoy

arXiv:2101.04226·cs.DB·April 13, 2021

DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using Bi-Directional Recurrent Neural Networks

Arif Usta, Akifhan Karakayali, \"Ozg\"ur Ulusoy

PDF

1 Repo

TL;DR

DBTagger is a deep learning model that improves keyword mapping in NLIDBs by using multi-task learning with bi-directional RNNs, achieving high accuracy and scalability across multiple datasets.

Contribution

It introduces a schema-independent, end-to-end multi-task learning approach for keyword mapping in NLIDBs using bi-directional RNNs, outperforming existing methods.

Findings

01

Achieved an average accuracy of 92.4% on eight datasets.

02

DBTagger is up to 10,000 times faster than comparable methods.

03

Demonstrated scalability for larger databases.

Abstract

Translating Natural Language Queries (NLQs) to Structured Query Language (SQL) in interfaces deployed in relational databases is a challenging task, which has been widely studied in database community recently. Conventional rule based systems utilize series of solutions as a pipeline to deal with each step of this task, namely stop word filtering, tokenization, stemming/lemmatization, parsing, tagging, and translation. Recent works have mostly focused on the translation step overlooking the earlier steps by using ad-hoc solutions. In the pipeline, one of the most critical and challenging problems is keyword mapping; constructing a mapping between tokens in the query and relational database elements (tables, attributes, values, etc.). We define the keyword mapping problem as a sequence tagging problem, and propose a novel deep learning based supervised approach that utilizes POS tags of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arifusta/DBTagger
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.