PlagBench: Exploring the Duality of Large Language Models in Plagiarism   Generation and Detection

Jooyoung Lee; Toshini Agrawal; Adaku Uchendu; Thai Le; Jinghui Chen,; Dongwon Lee

arXiv:2406.16288·cs.CL·February 18, 2025·2 cites

PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection

Jooyoung Lee, Toshini Agrawal, Adaku Uchendu, Thai Le, Jinghui Chen,, Dongwon Lee

PDF

Open Access 1 Video

TL;DR

This paper introduces PlagBench, a large dataset of synthetic plagiarism examples generated by LLMs, to evaluate LLMs' abilities in both creating and detecting various types of plagiarism, revealing significant detection improvements with GPT-4.

Contribution

The paper provides a new dataset, PlagBench, for studying LLMs in plagiarism generation and detection, and offers comprehensive evaluation of LLMs' capabilities in these tasks.

Findings

01

GPT-3.5 Turbo generates high-quality paraphrases and summaries.

02

GPT-4 outperforms other LLMs and detection tools by 20%.

03

LLMs show evolving abilities in both content creation and plagiarism detection.

Abstract

Recent studies have raised concerns about the potential threats large language models (LLMs) pose to academic integrity and copyright protection. Yet, their investigation is predominantly focused on literal copies of original texts. Also, how LLMs can facilitate the detection of LLM-generated plagiarism remains largely unexplored. To address these gaps, we introduce \textbf{{\sf PlagBench}}, a dataset of 46.5K synthetic text pairs that represent three major types of plagiarism: verbatim copying, paraphrasing, and summarization. These samples are generated by three advanced LLMs. We rigorously validate the quality of PlagBench through a combination of fine-grained automatic evaluation and human annotation. We then utilize this dataset for two purposes: (1) to examine LLMs' ability to transform original content into accurate paraphrases and summaries, and (2) to evaluate the plagiarism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection· underline

Taxonomy

TopicsAcademic integrity and plagiarism · Text Readability and Simplification · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Cosine Annealing · Position-Wise Feed-Forward Layer · Linear Layer · Residual Connection · Multi-Head Attention