
TL;DR
This paper introduces a preprocessor that normalizes source code to counteract various disguises used in source code plagiarism, enhancing the effectiveness of existing detection tools.
Contribution
It proposes a novel normalization preprocessor that improves the robustness of plagiarism detection against code disguises.
Findings
Preprocessor effectively neutralizes common disguises.
Enhanced detection accuracy with the normalization approach.
Applicable to existing plagiarism detection tools.
Abstract
Source code plagiarism detection is a problem that has been addressed several times before; and several tools have been developed for that purpose. In this research project we investigated a set of possible disguises that can be mechanically applied to plagiarized source code to defeat plagiarism detection tools. We propose a preprocessor to be used with existing plagiarism detection tools to "normalize" source code before checking it, thus making such disguises ineffective.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcademic integrity and plagiarism · Software Engineering Research · Advanced Malware Detection Techniques
