Deep Learning-Based Identification of Inconsistent Method Names: How Far   Are We?

Taiming Wang; Yuxia Zhang; Lin Jiang; Yi Tang; Guangjie Li; Hui Liu

arXiv:2501.12617·cs.SE·January 23, 2025

Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?

Taiming Wang, Yuxia Zhang, Lin Jiang, Yi Tang, Guangjie Li, Hui Liu

PDF

TL;DR

This study evaluates state-of-the-art deep learning methods for detecting inconsistent method names in code, revealing significant performance drops in realistic scenarios and proposing improvements with contrastive learning and LLMs.

Contribution

The paper provides an empirical assessment of DL-based inconsistency detection methods using a new benchmark, highlighting their limitations and proposing directions for enhancement.

Findings

01

Performance drops on realistic benchmark

02

Retrieval-based methods excel on simple cases

03

Generation-based methods struggle with similarity and name generation

Abstract

Concise and meaningful method names are crucial for program comprehension and maintenance. However, method names may become inconsistent with their corresponding implementations, causing confusion and errors. Several deep learning (DL)-based approaches have been proposed to identify such inconsistencies, with initial evaluations showing promising results. However, these evaluations typically use a balanced dataset, where the number of inconsistent and consistent names are equal. This setup, along with flawed dataset construction, leads to false positives, making reported performance less reliable in real-world scenarios, where most method names are consistent. In this paper, we present an empirical study that evaluates state-of-the-art DL-based methods for identifying inconsistent method names. We create a new benchmark by combining automatic identification from commit histories and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning