TL;DR
DBTagger is a deep learning model that improves keyword mapping in NLIDBs by using multi-task learning with bi-directional RNNs, achieving high accuracy and scalability across multiple datasets.
Contribution
It introduces a schema-independent, end-to-end multi-task learning approach for keyword mapping in NLIDBs using bi-directional RNNs, outperforming existing methods.
Findings
Achieved an average accuracy of 92.4% on eight datasets.
DBTagger is up to 10,000 times faster than comparable methods.
Demonstrated scalability for larger databases.
Abstract
Translating Natural Language Queries (NLQs) to Structured Query Language (SQL) in interfaces deployed in relational databases is a challenging task, which has been widely studied in database community recently. Conventional rule based systems utilize series of solutions as a pipeline to deal with each step of this task, namely stop word filtering, tokenization, stemming/lemmatization, parsing, tagging, and translation. Recent works have mostly focused on the translation step overlooking the earlier steps by using ad-hoc solutions. In the pipeline, one of the most critical and challenging problems is keyword mapping; constructing a mapping between tokens in the query and relational database elements (tables, attributes, values, etc.). We define the keyword mapping problem as a sequence tagging problem, and propose a novel deep learning based supervised approach that utilizes POS tags of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
