Hybrid Ranking Network for Text-to-SQL

Qin Lyu; Kaushik Chakrabarti; Shobhit Hathi; Souvik Kundu; Jianwen; Zhang; Zheng Chen

arXiv:2008.04759·cs.CL·August 12, 2020·50 cites

Hybrid Ranking Network for Text-to-SQL

Qin Lyu, Kaushik Chakrabarti, Shobhit Hathi, Souvik Kundu, Jianwen, Zhang, Zheng Chen

PDF

Open Access 2 Repos

TL;DR

This paper introduces HydraNet, a novel Text-to-SQL model that leverages pre-trained language models more effectively by focusing on column-wise ranking and decoding, leading to state-of-the-art results on WikiSQL.

Contribution

HydraNet breaks down Text-to-SQL into column-wise ranking and decoding, avoiding ad-hoc pooling and improving the utilization of pre-trained language models.

Findings

01

Achieves top performance on WikiSQL leaderboard.

02

Effectively leverages pre-trained models for Text-to-SQL.

03

Simplifies the decoding process with straightforward rules.

Abstract

In this paper, we study how to leverage pre-trained language models in Text-to-SQL. We argue that previous approaches under utilize the base language models by concatenating all columns together with the NL question and feeding them into the base language model in the encoding stage. We propose a neat approach called Hybrid Ranking Network (HydraNet) which breaks down the problem into column-wise ranking and decoding and finally assembles the column-wise outputs into a SQL query by straightforward rules. In this approach, the encoder is given a NL question and one individual column, which perfectly aligns with the original tasks BERT/RoBERTa is trained on, and hence we avoid any ad-hoc pooling or additional encoding layers which are necessary in prior approaches. Experiments on the WikiSQL dataset show that the proposed approach is very effective, achieving the top place on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Web Data Mining and Analysis · Data Management and Algorithms