A Syntactic Neural Model for General-Purpose Code Generation
Pengcheng Yin, Graham Neubig

TL;DR
This paper introduces a neural model that incorporates programming language syntax to improve the accuracy of translating natural language descriptions into source code, achieving state-of-the-art results.
Contribution
The paper presents a novel neural architecture that explicitly models syntax using a grammar, enhancing code generation from natural language.
Findings
Outperforms previous code generation methods
Effectively scales to complex programs
Achieves state-of-the-art accuracy
Abstract
We consider the problem of parsing natural language descriptions into source code written in a general-purpose programming language like Python. Existing data-driven methods treat this problem as a language generation task without considering the underlying syntax of the target programming language. Informed by previous work in semantic parsing, in this paper we propose a novel neural architecture powered by a grammar model to explicitly capture the target syntax as prior knowledge. Experiments find this an effective way to scale up to generation of complex programs from natural language descriptions, achieving state-of-the-art results that well outperform previous code generation and semantic parsing approaches.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research
