Do Developers Refactor Data Access Code? An Empirical Study
Biruk Asmare Muse, Foutse Khomh, Giuliano Antoniol

TL;DR
This study empirically investigates refactoring practices in data access code within data-intensive systems, revealing prevalent refactoring types, their evolution over time, and their focus on code quality rather than SQL queries.
Contribution
It provides the first comprehensive empirical analysis of data access refactorings, highlighting their characteristics, evolution, and targeted functionalities in data-intensive systems.
Findings
Data access refactorings are common and vary in type.
Refactoring prevalence and types change as systems evolve.
Most refactorings target data fetching and insertion code.
Abstract
Developers often refactor code to improve the maintainability and comprehension of the software. There are many studies on refactoring activities in traditional software systems. However, refactoring in data-intensive systems is not well explored. Understanding the refactoring practices of developers is important to develop efficient tool support.We conducted a longitudinal study of refactoring activities in data access classes using 12 data-intensive subject systems. We investigated the prevalence and evolution of refactorings and the association of refactorings with data access smells. We also conducted a manual analysis of over 378 samples of data access refactoring instances to identify the functionalities of the code that are targeted by such refactorings. Our results show that (1) data access refactorings are prevalent and different in type. \textit{Rename variable} is the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Data Quality and Management · Software System Performance and Reliability
