Automated Variable Renaming: Are We There Yet?
Antonio Mastropaolo, Emad Aghajani, Luca Pascarella, Gabriele Bavota

TL;DR
This paper evaluates the effectiveness of data-driven techniques for automated variable renaming, demonstrating their potential and limitations through large-scale experiments with state-of-the-art models.
Contribution
It provides the first large-scale empirical study comparing three advanced data-driven models for automated variable renaming.
Findings
Models can provide valuable renaming suggestions under certain conditions.
Current approaches have notable limitations that require further research.
The study offers insights into integrating these techniques into refactoring tools.
Abstract
Identifiers, such as method and variable names, form a large portion of source code. Therefore, low-quality identifiers can substantially hinder code comprehension. To support developers in using meaningful identifiers, several (semi-)automatic techniques have been proposed, mostly being data-driven (e.g. statistical language models, deep learning models) or relying on static code analysis. Still, limited empirical investigations have been performed on the effectiveness of such techniques for recommending developers with meaningful identifiers, possibly resulting in rename refactoring operations. We present a large-scale study investigating the potential of data-driven approaches to support automated variable renaming. We experiment with three state-of-the-art techniques: a statistical language model and two DL-based models. The three approaches have been trained and tested on three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Advanced Malware Detection Techniques
