The Effectiveness of Low-Level Structure-based Approach Toward Source   Code Plagiarism Level Taxonomy

Oscar Karnalim; Setia Budi

arXiv:1805.11035·cs.SE·May 29, 2018

The Effectiveness of Low-Level Structure-based Approach Toward Source Code Plagiarism Level Taxonomy

Oscar Karnalim, Setia Budi

PDF

TL;DR

This paper evaluates a low-level structure-based method for detecting source code plagiarism, demonstrating its effectiveness across various plagiarism levels and outperforming traditional token-based methods in real-world cases.

Contribution

It introduces an evaluation of the state-of-the-art low-level approach using real plagiarism data and confirms its superiority over baseline token-based methods across multiple plagiarism levels.

Findings

01

Effective in handling most plagiarism attacks

02

Outperforms baseline approach in most levels

03

Validated on real plagiarism cases

Abstract

Low-level approach is a novel way to detect source code plagiarism. Such approach is proven to be effective when compared to baseline approach (i.e., an approach which relies on source code token subsequence matching) in controlled environment. We evaluate the effectiveness of state of the art in low-level approach based on Faidhi \& Robinson's plagiarism level taxonomy; real plagiarism cases are employed as dataset in this work. Our evaluation shows that state of the art in low-level approach is effective to handle most plagiarism attacks. Further, it also outperforms its predecessor and baseline approach in most plagiarism levels.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.