Do LLMs Know to Respect Copyright Notice?

Jialiang Xu; Shenglan Li; Zhaozhuo Xu; Denghui Zhang

arXiv:2411.01136·cs.CL·November 5, 2024

Do LLMs Know to Respect Copyright Notice?

Jialiang Xu, Shenglan Li, Zhaozhuo Xu, Denghui Zhang

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether large language models respect copyright notices in user inputs, highlighting potential risks of copyright infringement and providing a benchmark dataset for future evaluation and alignment efforts.

Contribution

It introduces a comprehensive study on LLMs' respect for copyright notices and releases a benchmark dataset to evaluate infringement behaviors.

Findings

01

LLMs sometimes generate content that infringes copyright.

02

The study provides a conservative assessment of copyright infringement risk.

03

A benchmark dataset for evaluating LLMs' respect for copyright is released.

Abstract

Prior study shows that LLMs sometimes generate content that violates copyright. In this paper, we study another important yet underexplored problem, i.e., will LLMs respect copyright information in user input, and behave accordingly? The research problem is critical, as a negative answer would imply that LLMs will become the primary facilitator and accelerator of copyright infringement behavior. We conducted a series of experiments using a diverse set of language models, user prompts, and copyrighted materials, including books, news articles, API documentation, and movie scripts. Our study offers a conservative evaluation of the extent to which language models may infringe upon copyrights when processing user input containing protected material. This research emphasizes the need for further investigation and the importance of ensuring LLMs respect copyright regulations when handling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liamjxu/copyright
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLaw, AI, and Intellectual Property · Copyright and Intellectual Property · Legal Systems and Judicial Processes

MethodsSparse Evolutionary Training