AST-MHSA : Code Summarization using Multi-Head Self-Attention

Yeshwanth Nagaraj; Ujjwal Gupta

arXiv:2308.05646·cs.CL·August 11, 2023·1 cites

AST-MHSA : Code Summarization using Multi-Head Self-Attention

Yeshwanth Nagaraj, Ujjwal Gupta

PDF

Open Access

TL;DR

This paper introduces AST-MHSA, a novel transformer-based model that uses multi-head self-attention to efficiently extract semantic information from long ASTs for improved code summarization.

Contribution

The paper proposes a multi-head self-attention mechanism tailored for long ASTs, addressing computational challenges and enhancing semantic extraction in code summarization.

Findings

01

Improved summarization accuracy over baseline models

02

Reduced computational overhead compared to traditional transformer models

03

Effective extraction of semantic relations from long ASTs

Abstract

Code summarization aims to generate concise natural language descriptions for source code. The prevailing approaches adopt transformer-based encoder-decoder architectures, where the Abstract Syntax Tree (AST) of the source code is utilized for encoding structural information. However, ASTs are much longer than the corresponding source code, and existing methods ignore this size constraint by directly feeding the entire linearized AST into the encoders. This simplistic approach makes it challenging to extract truly valuable dependency relations from the overlong input sequence and leads to significant computational overhead due to self-attention applied to all nodes in the AST. To address this issue effectively and efficiently, we present a model, AST-MHSA that uses multi-head attention to extract the important semantic information from the AST. The model consists of two main…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Natural Language Processing Techniques · Web Data Mining and Analysis

MethodsLinear Layer · Softmax