Plagiarism: Taxonomy, Tools and Detection Techniques
Hussain A Chowdhury, Dhruba K Bhattacharyya

TL;DR
This survey provides a comprehensive overview of plagiarism types, discusses various detection tools and techniques, especially machine learning methods, and highlights ongoing challenges in the field.
Contribution
It offers a detailed taxonomy of plagiarism forms and reviews recent machine learning-based detection methods along with their advantages and limitations.
Findings
Identifies key forms of plagiarism and their impact.
Analyzes machine learning techniques for detection.
Highlights research challenges and future directions.
Abstract
To detect plagiarism of any form, it is essential to have broad knowledge of its possible forms and classes, and existence of various tools and systems for its detection. Based on impact or severity of damages, plagiarism may occur in an article or in any production in a number of ways. This survey presents a taxonomy of various plagiarism forms and include discussion on each of these forms. Over the years, a good number tools and techniques have been introduced to detect plagiarism. This paper highlights few promising methods for plagiarism detection based on machine learning techniques. We analyse the pros and cons of these methods and finally we highlight a list of issues and research challenges related to this evolving research problem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcademic integrity and plagiarism · Topic Modeling · Authorship Attribution and Profiling
