AI Text Detectors and the Misclassification of Slightly Polished Arabic Text
Saleh Almohaimeed, Saad Almohaimeed, Mousa Jari, Khaled A. Alobaid, Fahad Alotaibi

TL;DR
This study evaluates Arabic AI text detectors' ability to distinguish human-authored articles from AI-polished ones, revealing significant misclassification issues especially with slight polishing, which impacts credibility.
Contribution
It introduces two Arabic datasets and assesses the performance degradation of AI detectors when articles are slightly polished by LLMs.
Findings
All detectors misclassify many human articles as AI-generated.
Performance drops significantly for slightly polished texts.
Commercial models perform better but still face challenges.
Abstract
Many AI detection models have been developed to counter the presence of articles created by artificial intelligence (AI). However, if a human-authored article is slightly polished by AI, a shift will occur in the borderline decision of these AI detection models, leading them to consider it as AI-generated article. This misclassification may result in falsely accusing authors of AI plagiarism and harm the credibility of AI detectors. In English, some efforts were made to meet this challenge, but not in Arabic. In this paper, we generated two datasets. The first dataset contains 800 Arabic articles, half AI-generated and half human-authored. We used it to evaluate 14 Large Language models (LLMs) and commercial AI detectors to assess their ability in distinguishing between human-authored and AI-generated articles. The best 8 models were chosen to act as detectors for our primary concern,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcademic integrity and plagiarism · Authorship Attribution and Profiling · Topic Modeling
