Plagiarism Detection Using Machine Learning
Omraj Kamat, Tridib Ghosh, Kalaivani J, Angayarkanni V, Rama P

TL;DR
This paper presents a machine learning-based plagiarism detection system that leverages natural language processing and classification algorithms to identify both exact and paraphrased plagiarism with high accuracy.
Contribution
It introduces a novel machine learning approach for plagiarism detection that improves upon traditional methods by effectively identifying disguised and paraphrased content.
Findings
High precision and recall in detecting plagiarism
Effective identification of paraphrased content
Enhanced scalability over traditional methods
Abstract
Plagiarism is an act of using someone else's work without proper acknowledgment, and this sin is seen to cut across various arenas including the academy, publishing, and other similar arenas. The traditional methods of plagiarism detection through keyword matching and review by humans usually fail to cope with increasingly sophisticated techniques used to mask copy pasted content. This paper aims to introduce a plagiarism detection approach based on machine learning that utilizes natural language processing and complex classification algorithms toward efficient detection of similarities between the documents. The developed model has the capability to detect both exact and paraphrased plagiarism accurately using advanced feature extraction techniques with supervised learning algorithms. We adapted and tested our model on an extensive text sample dataset. And we demonstrated some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcademic integrity and plagiarism · Artificial Intelligence in Healthcare and Education
