Structural Transfer Learning in NL-to-Bash Semantic Parsers

Kyle Duffy; Satwik Bhattamishra; Phil Blunsom

arXiv:2307.16795·cs.CL·August 1, 2023

Structural Transfer Learning in NL-to-Bash Semantic Parsers

Kyle Duffy, Satwik Bhattamishra, Phil Blunsom

PDF

Open Access

TL;DR

This paper investigates how structural similarities between different NLP tasks affect transfer learning, revealing that lexical overlap is key and that more pre-training compute does not always enhance transfer to semantic parsing.

Contribution

It introduces a methodology to quantify structural overlap between NLP tasks and applies it to analyze transfer learning in NL-to-Bash semantic parsing.

Findings

01

Structural overlap between NL-to-Bash and SQL is strong.

02

Lexical alignment largely explains transfer success.

03

More pre-training compute does not guarantee better transfer.

Abstract

Large-scale pre-training has made progress in many fields of natural language processing, though little is understood about the design of pre-training datasets. We propose a methodology for obtaining a quantitative understanding of structural overlap between machine translation tasks. We apply our methodology to the natural language to Bash semantic parsing task (NLBash) and show that it is largely reducible to lexical alignment. We also find that there is strong structural overlap between NLBash and natural language to SQL. Additionally, we perform a study varying compute expended during pre-training on the English to German machine translation task and find that more compute expended during pre-training does not always correspond semantic representations with stronger transfer to NLBash.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification