Automatic Generation of Benchmarks for Plagiarism Detection Tools using   Grammatical Evolution

Manuel Cebrian; Manuel Alfonseca; Alfonso Ortega

arXiv:cs/0703134·cs.NE·January 7, 2008

Automatic Generation of Benchmarks for Plagiarism Detection Tools using Grammatical Evolution

Manuel Cebrian, Manuel Alfonseca, Alfonso Ortega

PDF

Open Access

TL;DR

This paper proposes an automated method to generate benchmark datasets for plagiarism detection tools using grammatical evolution, aiming to improve evaluation processes.

Contribution

It introduces a novel approach leveraging grammatical evolution to create diverse and challenging benchmarks for plagiarism detection systems.

Findings

01

Generated benchmarks improve evaluation robustness

02

Method outperforms manual benchmark creation

03

Enhances testing of plagiarism detection tools

Abstract

This paper has been withdrawn by the authors due to a major rewriting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Topic Modeling · Benford’s Law and Fraud Detection