Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use
Justin Ho, Alexandra Colby, and William Fisher

TL;DR
This paper develops a specialized Retrieval-Augmented Generation system that integrates legal knowledge graphs and court citation networks to enhance legal reasoning and retrieval accuracy for copyright fair use analysis.
Contribution
It introduces a structured legal retrieval approach combining semantic search, legal knowledge graphs, and citation networks within RAG for improved legal reasoning.
Findings
Improved doctrinal relevance in legal retrieval.
Enhanced reasoning through Chain-of-Thought and citation weighting.
Foundation for future legal AI tools.
Abstract
This paper presents a domain-specific implementation of Retrieval-Augmented Generation (RAG) tailored to the Fair Use Doctrine in U.S. copyright law. Motivated by the increasing prevalence of DMCA takedowns and the lack of accessible legal support for content creators, we propose a structured approach that combines semantic search with legal knowledge graphs and court citation networks to improve retrieval quality and reasoning reliability. Our prototype models legal precedents at the statutory factor level (e.g., purpose, nature, amount, market effect) and incorporates citation-weighted graph representations to prioritize doctrinally authoritative sources. We use Chain-of-Thought reasoning and interleaved retrieval steps to better emulate legal reasoning. Preliminary testing suggests this method improves doctrinal relevance in the retrieval process, laying groundwork for future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCopyright and Intellectual Property · Digital Rights Management and Security
