Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?
Taiming Wang, Yuxia Zhang, Lin Jiang, Yi Tang, Guangjie Li, Hui Liu

TL;DR
This study evaluates state-of-the-art deep learning methods for detecting inconsistent method names in code, revealing significant performance drops in realistic scenarios and proposing improvements with contrastive learning and LLMs.
Contribution
The paper provides an empirical assessment of DL-based inconsistency detection methods using a new benchmark, highlighting their limitations and proposing directions for enhancement.
Findings
Performance drops on realistic benchmark
Retrieval-based methods excel on simple cases
Generation-based methods struggle with similarity and name generation
Abstract
Concise and meaningful method names are crucial for program comprehension and maintenance. However, method names may become inconsistent with their corresponding implementations, causing confusion and errors. Several deep learning (DL)-based approaches have been proposed to identify such inconsistencies, with initial evaluations showing promising results. However, these evaluations typically use a balanced dataset, where the number of inconsistent and consistent names are equal. This setup, along with flawed dataset construction, leads to false positives, making reported performance less reliable in real-world scenarios, where most method names are consistent. In this paper, we present an empirical study that evaluates state-of-the-art DL-based methods for identifying inconsistent method names. We create a new benchmark by combining automatic identification from commit histories and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
