Code Generation as a Dual Task of Code Summarization

Bolin Wei; Ge Li; Xin Xia; Zhiyi Fu; Zhi Jin

arXiv:1910.05923·cs.LG·October 16, 2019·64 cites

Code Generation as a Dual Task of Code Summarization

Bolin Wei, Ge Li, Xin Xia, Zhiyi Fu, Zhi Jin

PDF

Open Access 2 Repos

TL;DR

This paper introduces a dual training framework that leverages the intrinsic correlation between code summarization and code generation tasks, improving their performance through joint learning and regularization.

Contribution

It proposes a novel dual training approach that exploits the duality between CS and CG, which was not previously utilized, to enhance both tasks simultaneously.

Findings

01

Improved accuracy over baseline models on GitHub datasets

02

Effective use of duality regularization in training

03

Enhanced performance of code summarization and generation tasks

Abstract

Code summarization (CS) and code generation (CG) are two crucial tasks in the field of automatic software development. Various neural network-based approaches are proposed to solve these two tasks separately. However, there exists a specific intuitive correlation between CS and CG, which have not been exploited in previous work. In this paper, we apply the relations between two tasks to improve the performance of both tasks. In other words, exploiting the duality between the two tasks, we propose a dual training framework to train the two tasks simultaneously. In this framework, we consider the dualities on probability and attention weights, and design corresponding regularization terms to constrain the duality. We evaluate our approach on two datasets collected from GitHub, and experimental results show that our dual framework can improve the performance of CS and CG tasks over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Software Testing and Debugging Techniques