Duplicated Code Pattern Mining in Visual Programming Languages

Miguel Terra-Neves; Jo\~ao Nadkarni; Miguel Ventura; Pedro Resende,; Hugo Veiga; Ant\'onio Alegria

arXiv:2107.07212·cs.SE·July 16, 2021

Duplicated Code Pattern Mining in Visual Programming Languages

Miguel Terra-Neves, Jo\~ao Nadkarni, Miguel Ventura, Pedro Resende,, Hugo Veiga, Ant\'onio Alegria

PDF

TL;DR

This paper introduces a scalable algorithm for detecting duplicated code patterns in Visual Programming Languages, addressing a gap in tools for maintaining code quality in VPL-based systems.

Contribution

It presents a novel pattern mining algorithm that leverages visual structures in VPLs to detect and explain duplicated code patterns, improving maintainability.

Findings

01

Duplication in OutSystems VPL codebases can reach 39%.

02

The proposed algorithm effectively detects duplicated code patterns.

03

Evaluation on real-world applications demonstrates scalability and accuracy.

Abstract

Visual Programming Languages (VPLs), coupled with the high-level abstractions that are commonplace in visual programming environments, enable users with less technical knowledge to become proficient programmers. However, the lower skill floor required by VPLs also entails that programmers are more likely to not adhere to best practices of software development, producing systems with high technical debt, and thus poor maintainability. Duplicated code is one important example of such technical debt. In fact, we observed that the amount of duplication in the OutSystems VPL code bases can reach as high as $39%$ . Duplicated code detection in text-based programming languages is still an active area of research with important implications regarding software maintainability and evolution. However, to the best of our knowledge, the literature on duplicated code detection for VPLs is very…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.