Code Generation as a Dual Task of Code Summarization
Bolin Wei, Ge Li, Xin Xia, Zhiyi Fu, Zhi Jin

TL;DR
This paper introduces a dual training framework that leverages the intrinsic correlation between code summarization and code generation tasks, improving their performance through joint learning and regularization.
Contribution
It proposes a novel dual training approach that exploits the duality between CS and CG, which was not previously utilized, to enhance both tasks simultaneously.
Findings
Improved accuracy over baseline models on GitHub datasets
Effective use of duality regularization in training
Enhanced performance of code summarization and generation tasks
Abstract
Code summarization (CS) and code generation (CG) are two crucial tasks in the field of automatic software development. Various neural network-based approaches are proposed to solve these two tasks separately. However, there exists a specific intuitive correlation between CS and CG, which have not been exploited in previous work. In this paper, we apply the relations between two tasks to improve the performance of both tasks. In other words, exploiting the duality between the two tasks, we propose a dual training framework to train the two tasks simultaneously. In this framework, we consider the dualities on probability and attention weights, and design corresponding regularization terms to constrain the duality. We evaluate our approach on two datasets collected from GitHub, and experimental results show that our dual framework can improve the performance of CS and CG tasks over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Software Testing and Debugging Techniques
