Is Contrasting All You Need? Contrastive Learning for the Detection and   Attribution of AI-generated Text

Lucio La Cava; Davide Costa; Andrea Tagarelli

arXiv:2407.09364·cs.CL·March 18, 2025·2 cites

Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text

Lucio La Cava, Davide Costa, Andrea Tagarelli

PDF

Open Access

TL;DR

This paper introduces WhosAI, a contrastive learning framework that effectively detects and attributes AI-generated text by learning semantic similarities across multiple generators, outperforming existing methods on benchmark datasets.

Contribution

The paper presents a novel triplet-network contrastive learning approach that handles both detection and attribution tasks simultaneously and is scalable to new AI text-generation models.

Findings

01

Outperforms all methods on TuringBench benchmark

02

Effectively handles multiple AI generators simultaneously

03

Achieves state-of-the-art results in detection and attribution

Abstract

The significant progress in the development of Large Language Models has contributed to blurring the distinction between human and AI-generated text. The increasing pervasiveness of AI-generated text and the difficulty in detecting it poses new challenges for our society. In this paper, we tackle the problem of detecting and attributing AI-generated text by proposing WhosAI, a triplet-network contrastive learning framework designed to predict whether a given input text has been generated by humans or AI and to unveil the authorship of the text. Unlike most existing approaches, our proposed framework is conceived to learn semantic similarity representations from multiple generators at once, thus equally handling both detection and attribution tasks. Furthermore, WhosAI is model-agnostic and scalable to the release of new AI text-generation models by incorporating their generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsContrastive Learning