Automatic Generation of Benchmarks for Plagiarism Detection Tools using Grammatical Evolution
Manuel Cebrian, Manuel Alfonseca, Alfonso Ortega

TL;DR
This paper proposes an automated method to generate benchmark datasets for plagiarism detection tools using grammatical evolution, aiming to improve evaluation processes.
Contribution
It introduces a novel approach leveraging grammatical evolution to create diverse and challenging benchmarks for plagiarism detection systems.
Findings
Generated benchmarks improve evaluation robustness
Method outperforms manual benchmark creation
Enhances testing of plagiarism detection tools
Abstract
This paper has been withdrawn by the authors due to a major rewriting.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Topic Modeling · Benford’s Law and Fraud Detection
