A Syntactic Neural Model for General-Purpose Code Generation

Pengcheng Yin; Graham Neubig

arXiv:1704.01696·cs.CL·April 7, 2017·115 cites

A Syntactic Neural Model for General-Purpose Code Generation

Pengcheng Yin, Graham Neubig

PDF

Open Access 5 Repos

TL;DR

This paper introduces a neural model that incorporates programming language syntax to improve the accuracy of translating natural language descriptions into source code, achieving state-of-the-art results.

Contribution

The paper presents a novel neural architecture that explicitly models syntax using a grammar, enhancing code generation from natural language.

Findings

01

Outperforms previous code generation methods

02

Effectively scales to complex programs

03

Achieves state-of-the-art accuracy

Abstract

We consider the problem of parsing natural language descriptions into source code written in a general-purpose programming language like Python. Existing data-driven methods treat this problem as a language generation task without considering the underlying syntax of the target programming language. Informed by previous work in semantic parsing, in this paper we propose a novel neural architecture powered by a grammar model to explicitly capture the target syntax as prior knowledge. Experiments find this an effective way to scale up to generation of complex programs from natural language descriptions, achieving state-of-the-art results that well outperform previous code generation and semantic parsing approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research