M-DAIGT: A Shared Task on Multi-Domain Detection of AI-Generated Text

Salima Lamsiyah; Saad Ezzini; Abdelkader El Mahdaouy; Hamza Alami; Abdessamad Benlahbib; Samir El Amrany; Salmane Chafik; Hicham Hammouchi

arXiv:2511.11340·cs.CL·November 17, 2025

M-DAIGT: A Shared Task on Multi-Domain Detection of AI-Generated Text

Salima Lamsiyah, Saad Ezzini, Abdelkader El Mahdaouy, Hamza Alami, Abdessamad Benlahbib, Samir El Amrany, Salmane Chafik, Hicham Hammouchi

PDF

Open Access

TL;DR

This paper introduces M-DAIGT, a shared task and benchmark dataset for detecting AI-generated text across news and academic domains, highlighting current approaches and future challenges.

Contribution

It presents a new large-scale dataset and a shared task for multi-domain AI-generated text detection, fostering research in this critical area.

Findings

01

Four teams participated in the shared task.

02

The dataset includes 30,000 samples from various LLMs.

03

Methods varied across teams, indicating diverse approaches.

Abstract

The generation of highly fluent text by Large Language Models (LLMs) poses a significant challenge to information integrity and academic research. In this paper, we introduce the Multi-Domain Detection of AI-Generated Text (M-DAIGT) shared task, which focuses on detecting AI-generated text across multiple domains, particularly in news articles and academic writing. M-DAIGT comprises two binary classification subtasks: News Article Detection (NAD) (Subtask 1) and Academic Writing Detection (AWD) (Subtask 2). To support this task, we developed and released a new large-scale benchmark dataset of 30,000 samples, balanced between human-written and AI-generated texts. The AI-generated content was produced using a variety of modern LLMs (e.g., GPT-4, Claude) and diverse prompting strategies. A total of 46 unique teams registered for the shared task, of which four teams submitted final results.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Computational and Text Analysis Methods · Topic Modeling