Neural Machine Translation for Code Generation

Dharma KC; Clayton T. Morrison

arXiv:2305.13504·cs.CL·May 24, 2023·1 cites

Neural Machine Translation for Code Generation

Dharma KC, Clayton T. Morrison

PDF

Open Access

TL;DR

This paper surveys neural machine translation techniques applied to code generation, covering various input/output formats, architectures, and evaluation methods, highlighting current limitations and future research directions.

Contribution

It provides a comprehensive overview of NMT for code generation, categorizing methods and identifying gaps in existing research.

Findings

01

Diverse input/output representations have been explored in NMT for code.

02

Current methods face limitations in handling complex code generation tasks.

03

Future research should focus on improving model robustness and evaluation metrics.

Abstract

Neural machine translation (NMT) methods developed for natural language processing have been shown to be highly successful in automating translation from one natural language to another. Recently, these NMT methods have been adapted to the generation of program code. In NMT for code generation, the task is to generate output source code that satisfies constraints expressed in the input. In the literature, a variety of different input scenarios have been explored, including generating code based on natural language description, lower-level representations such as binary or assembly (neural decompilation), partial representations of source code (code completion and repair), and source code in another language (code translation). In this paper we survey the NMT for code generation literature, cataloging the variety of methods that have been explored according to input and output…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Natural Language Processing Techniques · Machine Learning and Data Classification