DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive   Learning

Xun Guo; Shan Zhang; Yongxin He; Ting Zhang; Wanquan Feng; Haibin; Huang; Chongyang Ma

arXiv:2410.20964·cs.CL·October 29, 2024·3 cites

DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning

Xun Guo, Shan Zhang, Yongxin He, Ting Zhang, Wanquan Feng, Haibin, Huang, Chongyang Ma

PDF

Open Access 1 Repo 1 Models 1 Video

TL;DR

DeTeCtive introduces a multi-level contrastive learning framework that improves AI-generated text detection by focusing on writing style differences, achieving state-of-the-art results and strong out-of-distribution performance.

Contribution

The paper presents a novel multi-task contrastive learning approach that enhances detection of AI-generated text and generalizes well to out-of-distribution data, surpassing existing methods.

Findings

01

Outperforms existing approaches in OOD zero-shot evaluation

02

Enhances detection capabilities across multiple benchmarks

03

Offers training-free incremental adaptation for OOD data

Abstract

Current techniques for detecting AI-generated text are largely confined to manual feature crafting and supervised binary classification paradigms. These methodologies typically lead to performance bottlenecks and unsatisfactory generalizability. Consequently, these methods are often inapplicable for out-of-distribution (OOD) data and newly emerged large language models (LLMs). In this paper, we revisit the task of AI-generated text detection. We argue that the key to accomplishing this task lies in distinguishing writing styles of different authors, rather than simply classifying the text into human-written or AI-generated text. To this end, we propose DeTeCtive, a multi-task auxiliary, multi-level contrastive learning framework. DeTeCtive is designed to facilitate the learning of distinct writing styles, combined with a dense information retrieval pipeline for AI-generated text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

heyongxin233/detective
pytorchOfficial

Models

🤗
heyongxin233/DeTeCtive
model

Videos

DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning· slideslive

Taxonomy

TopicsTopic Modeling

MethodsContrastive Learning