Understanding Code Semantics: An Evaluation of Transformer Models in   Summarization

Debanjan Mondal; Abhilasha Lodha; Ankita Sahoo; Beena Kumari

arXiv:2310.16314·cs.LG·October 30, 2023·2 cites

Understanding Code Semantics: An Evaluation of Transformer Models in Summarization

Debanjan Mondal, Abhilasha Lodha, Ankita Sahoo, Beena Kumari

PDF

Open Access 1 Repo

TL;DR

This study evaluates whether transformer models genuinely understand code semantics in summarization tasks by testing their robustness against semantic alterations and adversarial code snippets across multiple programming languages.

Contribution

The paper introduces a comprehensive empirical evaluation of transformer models' understanding of code semantics, including adversarial code modifications, across Python, Javascript, and Java.

Findings

01

Models rely heavily on textual cues rather than true semantic understanding.

02

Adversarial code snippets significantly reduce model performance.

03

Transformer models show limited robustness to semantic alterations.

Abstract

This paper delves into the intricacies of code summarization using advanced transformer-based language models. Through empirical studies, we evaluate the efficacy of code summarization by altering function and variable names to explore whether models truly understand code semantics or merely rely on textual cues. We have also introduced adversaries like dead code and commented code across three programming languages (Python, Javascript, and Java) to further scrutinize the model's understanding. Ultimately, our research aims to offer valuable insights into the inner workings of transformer-based LMs, enhancing their ability to understand code and contributing to more efficient software development practices and maintenance workflows.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Demon702/robust_code_summary
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Computational Physics and Python Applications