IntegrityAI at GenAI Detection Task 2: Detecting Machine-Generated   Academic Essays in English and Arabic Using ELECTRA and Stylometry

Mohammad AL-Smadi

arXiv:2501.05476·cs.CL·January 13, 2025

IntegrityAI at GenAI Detection Task 2: Detecting Machine-Generated Academic Essays in English and Arabic Using ELECTRA and Stylometry

Mohammad AL-Smadi

PDF

Open Access

TL;DR

This paper presents ELECTRA-based models with stylometric features for detecting machine-generated academic essays in English and Arabic, achieving top-tier F1-scores and competitive rankings in a benchmark challenge.

Contribution

It introduces language-specific ELECTRA models combined with stylometry for improved detection of AI-generated essays in multiple languages.

Findings

01

Achieved 99.7% F1-score in English detection

02

Achieved 98.4% F1-score in Arabic detection

03

Ranked 2nd in English and 1st in Arabic subtasks

Abstract

Recent research has investigated the problem of detecting machine-generated essays for academic purposes. To address this challenge, this research utilizes pre-trained, transformer-based models fine-tuned on Arabic and English academic essays with stylometric features. Custom models based on ELECTRA for English and AraELECTRA for Arabic were trained and evaluated using a benchmark dataset. Proposed models achieved excellent results with an F1-score of 99.7%, ranking 2nd among of 26 teams in the English subtask, and 98.4%, finishing 1st out of 23 teams in the Arabic one.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Academic integrity and plagiarism

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Dropout · Linear Layer · Weight Decay · Multi-Head Attention · Layer Normalization · Dense Connections · Attention Dropout · Softmax · Linear Warmup With Linear Decay