On the Zero-Shot Generalization of Machine-Generated Text Detectors

Xiao Pu; Jingyu Zhang; Xiaochuang Han; Yulia Tsvetkov; Tianxing He

arXiv:2310.05165·cs.CL·October 10, 2023·1 cites

On the Zero-Shot Generalization of Machine-Generated Text Detectors

Xiao Pu, Jingyu Zhang, Xiaochuang Han, Yulia Tsvetkov, Tianxing He

PDF

Open Access

TL;DR

This paper investigates how machine-generated text detectors perform on unseen language models, finding that detectors trained on medium-sized models can effectively generalize to larger models in a zero-shot manner.

Contribution

It introduces a comprehensive evaluation of detector generalization across diverse language models and demonstrates the effectiveness of ensemble training on medium-sized models for robust detection.

Findings

01

Detectors trained on medium-sized models can zero-shot generalize to larger models.

02

Ensemble training on medium-sized models enhances detector robustness.

03

Detectors do not generalize well across all generators, highlighting the challenge of unseen model detection.

Abstract

The rampant proliferation of large language models, fluent enough to generate text indistinguishable from human-written language, gives unprecedented importance to the detection of machine-generated text. This work is motivated by an important research question: How will the detectors of machine-generated text perform on outputs of a new generator, that the detectors were not trained on? We begin by collecting generation data from a wide range of LLMs, and train neural detectors on data from each generator and test its performance on held-out generators. While none of the detectors can generalize to all generators, we observe a consistent and interesting pattern that the detectors trained on data from a medium-size LLM can zero-shot generalize to the larger version. As a concrete application, we demonstrate that robust detectors can be built on an ensemble of training data from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis

MethodsNone