DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Xun Guo, Shan Zhang, Yongxin He, Ting Zhang, Wanquan Feng, Haibin, Huang, Chongyang Ma

TL;DR
DeTeCtive introduces a multi-level contrastive learning framework that improves AI-generated text detection by focusing on writing style differences, achieving state-of-the-art results and strong out-of-distribution performance.
Contribution
The paper presents a novel multi-task contrastive learning approach that enhances detection of AI-generated text and generalizes well to out-of-distribution data, surpassing existing methods.
Findings
Outperforms existing approaches in OOD zero-shot evaluation
Enhances detection capabilities across multiple benchmarks
Offers training-free incremental adaptation for OOD data
Abstract
Current techniques for detecting AI-generated text are largely confined to manual feature crafting and supervised binary classification paradigms. These methodologies typically lead to performance bottlenecks and unsatisfactory generalizability. Consequently, these methods are often inapplicable for out-of-distribution (OOD) data and newly emerged large language models (LLMs). In this paper, we revisit the task of AI-generated text detection. We argue that the key to accomplishing this task lies in distinguishing writing styles of different authors, rather than simply classifying the text into human-written or AI-generated text. To this end, we propose DeTeCtive, a multi-task auxiliary, multi-level contrastive learning framework. DeTeCtive is designed to facilitate the learning of distinct writing styles, combined with a dense information retrieval pipeline for AI-generated text…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling
MethodsContrastive Learning
