Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via   Role Recognition and Involvement Measurement

Zihao Cheng; Li Zhou; Feng Jiang; Benyou Wang; Haizhou Li

arXiv:2410.14259·cs.CL·February 7, 2025

Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement

Zihao Cheng, Li Zhou, Feng Jiang, Benyou Wang, Haizhou Li

PDF

Open Access 1 Video

TL;DR

This paper introduces a new framework for detecting LLM-generated content by recognizing specific roles of LLMs and measuring their influence, moving beyond simple binary detection to more nuanced understanding.

Contribution

It proposes two novel tasks, LLM Role Recognition and Influence Measurement, along with a benchmark dataset and evaluation suite for more comprehensive detection of LLM-generated content.

Findings

01

Fine-tuned PLM models outperform others in detection tasks.

02

Advanced LLMs struggle to detect their own generated content.

03

The new tasks improve understanding of LLM involvement in content creation.

Abstract

The rapid development of large language models (LLMs), like ChatGPT, has resulted in the widespread presence of LLM-generated content on social media platforms, raising concerns about misinformation, data biases, and privacy violations, which can undermine trust in online discourse. While detecting LLM-generated content is crucial for mitigating these risks, current methods often focus on binary classification, failing to address the complexities of real-world scenarios like human-LLM collaboration. To move beyond binary classification and address these challenges, we propose a new paradigm for detecting LLM-generated content. This approach introduces two novel tasks: LLM Role Recognition (LLM-RR), a multi-class classification task that identifies specific roles of LLM in content generation, and LLM Influence Measurement (LLM-IM), a regression task that quantifies the extent of LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Translation Studies and Practices

MethodsFocus