Project-Level Encoding for Neural Source Code Summarization of   Subroutines

Aakash Bansal; Sakib Haque; Collin McMillan

arXiv:2103.11599·cs.SE·March 23, 2021

Project-Level Encoding for Neural Source Code Summarization of Subroutines

Aakash Bansal, Sakib Haque, Collin McMillan

PDF

1 Repo

TL;DR

This paper introduces a project-level encoder that enhances neural source code summarization models by incorporating contextual information from related files within a software project, leading to improved summarization performance.

Contribution

The paper proposes a novel project-level encoding technique that augments existing neural models for code summarization, addressing the limitation of models relying solely on individual code snippets.

Findings

01

Project-level encoding improves summarization accuracy.

02

Guidelines for balancing model performance and resource costs.

03

Enhanced models outperform baseline approaches.

Abstract

Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine. The description usually serves in documentation aimed at programmers, where even brief phrase (e.g. "compresses data to a zip file") can help readers rapidly comprehend what a subroutine does without resorting to reading the code itself. Techniques based on neural networks (and encoder-decoder model designs in particular) have established themselves as the state-of-the-art. Yet a problem widely recognized with these models is that they assume the information needed to create a summary is present within the code being summarized itself - an assumption which is at odds with program comprehension literature. Thus a current research frontier lies in the question of encoding source code context into neural models of summarization. In this paper, we present a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aakashba/projcon
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.