TL;DR
This paper introduces a project-level encoder that enhances neural source code summarization models by incorporating contextual information from related files within a software project, leading to improved summarization performance.
Contribution
The paper proposes a novel project-level encoding technique that augments existing neural models for code summarization, addressing the limitation of models relying solely on individual code snippets.
Findings
Project-level encoding improves summarization accuracy.
Guidelines for balancing model performance and resource costs.
Enhanced models outperform baseline approaches.
Abstract
Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine. The description usually serves in documentation aimed at programmers, where even brief phrase (e.g. "compresses data to a zip file") can help readers rapidly comprehend what a subroutine does without resorting to reading the code itself. Techniques based on neural networks (and encoder-decoder model designs in particular) have established themselves as the state-of-the-art. Yet a problem widely recognized with these models is that they assume the information needed to create a summary is present within the code being summarized itself - an assumption which is at odds with program comprehension literature. Thus a current research frontier lies in the question of encoding source code context into neural models of summarization. In this paper, we present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
