Towards Reliable Detection of LLM-Generated Texts: A Comprehensive   Evaluation Framework with CUDRT

Zhen Tao; Yanfang Chen; Dinghao Xi; Zhiyu Li; Wei Xu

arXiv:2406.09056·cs.CL·December 18, 2024

Towards Reliable Detection of LLM-Generated Texts: A Comprehensive Evaluation Framework with CUDRT

Zhen Tao, Yanfang Chen, Dinghao Xi, Zhiyu Li, Wei Xu

PDF

Open Access 2 Repos

TL;DR

This paper introduces CUDRT, a comprehensive bilingual evaluation framework for detecting LLM-generated texts across diverse operations and languages, addressing limitations of existing benchmarks.

Contribution

It presents a novel, scalable evaluation framework with extensive datasets in Chinese and English, enabling in-depth analysis of detection performance across multiple LLM activities.

Findings

01

Framework improves detection reliability across languages.

02

Operational diversity impacts detection accuracy.

03

Multilingual training enhances cross-linguistic detection performance.

Abstract

The increasing prevalence of large language models (LLMs) has significantly advanced text generation, but the human-like quality of LLM outputs presents major challenges in reliably distinguishing between human-authored and LLM-generated texts. Existing detection benchmarks are constrained by their reliance on static datasets, scenario-specific tasks (e.g., question answering and text refinement), and a primary focus on English, overlooking the diverse linguistic and operational subtleties of LLMs. To address these gaps, we propose CUDRT, a comprehensive evaluation framework and bilingual benchmark in Chinese and English, categorizing LLM activities into five key operations: Create, Update, Delete, Rewrite, and Translate. CUDRT provides extensive datasets tailored to each operation, featuring outputs from state-of-the-art LLMs to assess the reliability of LLM-generated text detectors.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsFocus