Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement
Zihao Cheng, Li Zhou, Feng Jiang, Benyou Wang, Haizhou Li

TL;DR
This paper introduces a new framework for detecting LLM-generated content by recognizing specific roles of LLMs and measuring their influence, moving beyond simple binary detection to more nuanced understanding.
Contribution
It proposes two novel tasks, LLM Role Recognition and Influence Measurement, along with a benchmark dataset and evaluation suite for more comprehensive detection of LLM-generated content.
Findings
Fine-tuned PLM models outperform others in detection tasks.
Advanced LLMs struggle to detect their own generated content.
The new tasks improve understanding of LLM involvement in content creation.
Abstract
The rapid development of large language models (LLMs), like ChatGPT, has resulted in the widespread presence of LLM-generated content on social media platforms, raising concerns about misinformation, data biases, and privacy violations, which can undermine trust in online discourse. While detecting LLM-generated content is crucial for mitigating these risks, current methods often focus on binary classification, failing to address the complexities of real-world scenarios like human-LLM collaboration. To move beyond binary classification and address these challenges, we propose a new paradigm for detecting LLM-generated content. This approach introduces two novel tasks: LLM Role Recognition (LLM-RR), a multi-class classification task that identifies specific roles of LLM in content generation, and LLM Influence Measurement (LLM-IM), a regression task that quantifies the extent of LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Translation Studies and Practices
MethodsFocus
