Features Based Text Similarity Detection
Chow Kok Kent, Naomie Salim

TL;DR
This paper introduces a feature-enhanced fingerprint matching approach for plagiarism detection that efficiently identifies similar key sentences, reducing resource consumption while maintaining detection accuracy.
Contribution
It proposes a novel method combining key feature selection with fingerprint matching to improve efficiency in large content plagiarism detection.
Findings
Reduces time and space consumption in fingerprint matching
Maintains high effectiveness in detecting plagiarism
Selects key sentences to improve comparison efficiency
Abstract
As the Internet help us cross cultural border by providing different information, plagiarism issue is bound to arise. As a result, plagiarism detection becomes more demanding in overcoming this issue. Different plagiarism detection tools have been developed based on various detection techniques. Nowadays, fingerprint matching technique plays an important role in those detection tools. However, in handling some large content articles, there are some weaknesses in fingerprint matching technique especially in space and time consumption issue. In this paper, we propose a new approach to detect plagiarism which integrates the use of fingerprint matching technique with four key features to assist in the detection process. These proposed features are capable to choose the main point or key sentence in the articles to be compared. Those selected sentence will be undergo the fingerprint matching…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
