Domain Gating Ensemble Networks for AI-Generated Text Detection

Arihant Tripathi; Liam Dugan; Charis Gao; Maggie Huan; Emma Jin; Peter Zhang; David Zhang; Julia Zhao; Chris Callison-Burch

arXiv:2505.13855·cs.CL·May 21, 2025

Domain Gating Ensemble Networks for AI-Generated Text Detection

Arihant Tripathi, Liam Dugan, Charis Gao, Maggie Huan, Emma Jin, Peter Zhang, David Zhang, Julia Zhao, Chris Callison-Burch

PDF

Open Access

TL;DR

This paper introduces DoGEN, a domain gating ensemble method that enhances AI-generated text detection across unseen domains by combining expert models with domain classification, achieving state-of-the-art results.

Contribution

The paper proposes DoGEN, a novel ensemble approach that improves domain adaptation in AI-generated text detection, outperforming larger models on unseen domains.

Findings

01

Achieves state-of-the-art in-domain detection performance.

02

Outperforms larger models on out-of-domain detection.

03

Provides open-source code and models for future research.

Abstract

As state-of-the-art language models continue to improve, the need for robust detection of machine-generated text becomes increasingly critical. However, current state-of-the-art machine text detectors struggle to adapt to new unseen domains and generative models. In this paper we present DoGEN (Domain Gating Ensemble Networks), a technique that allows detectors to adapt to unseen domains by ensembling a set of domain expert detector models using weights from a domain classifier. We test DoGEN on a wide variety of domains from leading benchmarks and find that it achieves state-of-the-art performance on in-domain detection while outperforming models twice its size on out-of-domain detection. We release our code and trained models to assist in future research in domain-adaptive AI detection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsSparse Evolutionary Training