Neural Generation of Regular Expressions from Natural Language with   Minimal Domain Knowledge

Nicholas Locascio; Karthik Narasimhan; Eduardo DeLeon; Nate Kushman,; Regina Barzilay

arXiv:1608.03000·cs.CL·August 11, 2016

Neural Generation of Regular Expressions from Natural Language with Minimal Domain Knowledge

Nicholas Locascio, Karthik Narasimhan, Eduardo DeLeon, Nate Kushman,, Regina Barzilay

PDF

2 Repos 1 Datasets

TL;DR

This paper presents a neural approach to translating natural language queries into regular expressions without domain-specific knowledge, utilizing a large corpus for training, and achieving significant performance improvements.

Contribution

The paper introduces a neural model that learns to generate regular expressions from natural language without domain-specific crafting, supported by a new large corpus of paired data.

Findings

01

Achieved a 19.6% performance improvement over previous models

02

Developed a methodology for collecting large regular expression and natural language pairs

03

Demonstrated the effectiveness of neural models in semantic translation tasks

Abstract

This paper explores the task of translating natural language queries into regular expressions which embody their meaning. In contrast to prior work, the proposed neural model does not utilize domain-specific crafting, learning to translate directly from a parallel corpus. To fully explore the potential of neural models, we propose a methodology for collecting a large corpus of regular expression, natural language pairs. Our resulting model achieves a performance gain of 19.6% over previous state-of-the-art models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

inclinedadarsh/nl-to-regex
dataset· 10 dl
10 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.