Detecting Disguised Plagiarism

Hatem A. Mahmoud

arXiv:1711.02149·cs.OH·November 8, 2017

Detecting Disguised Plagiarism

Hatem A. Mahmoud

PDF

Open Access

TL;DR

This paper introduces a preprocessor that normalizes source code to counteract various disguises used in source code plagiarism, enhancing the effectiveness of existing detection tools.

Contribution

It proposes a novel normalization preprocessor that improves the robustness of plagiarism detection against code disguises.

Findings

01

Preprocessor effectively neutralizes common disguises.

02

Enhanced detection accuracy with the normalization approach.

03

Applicable to existing plagiarism detection tools.

Abstract

Source code plagiarism detection is a problem that has been addressed several times before; and several tools have been developed for that purpose. In this research project we investigated a set of possible disguises that can be mechanically applied to plagiarized source code to defeat plagiarism detection tools. We propose a preprocessor to be used with existing plagiarism detection tools to "normalize" source code before checking it, thus making such disguises ineffective.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAcademic integrity and plagiarism · Software Engineering Research · Advanced Malware Detection Techniques