Legal Extractive Summarization of U.S. Court Opinions

Emmanuel Bauer; Dominik Stammbach; Nianlong Gu; Elliott Ash

arXiv:2305.08428·cs.CL·May 16, 2023·6 cites

Legal Extractive Summarization of U.S. Court Opinions

Emmanuel Bauer, Dominik Stammbach, Nianlong Gu, Elliott Ash

PDF

Open Access 1 Repo

TL;DR

This paper presents a reinforcement learning-based extractive summarization model for U.S. court opinions, demonstrating superior automated and human-evaluated performance and open-sourcing the models to improve legal accessibility.

Contribution

Introduces MemSum, a reinforcement learning model for legal extractive summarization that outperforms transformer models and is publicly available.

Findings

01

MemSum outperforms transformer-based models in automated metrics.

02

Human evaluation confirms MemSum effectively captures key points.

03

Open-sourcing promotes legal accessibility and democratization.

Abstract

This paper tackles the task of legal extractive summarization using a dataset of 430K U.S. court opinions with key passages annotated. According to automated summary quality metrics, the reinforcement-learning-based MemSum model is best and even out-performs transformer-based models. In turn, expert human evaluation shows that MemSum summaries effectively capture the key points of lengthy court opinions. Motivated by these results, we open-source our models to the general public. This represents progress towards democratizing law and making U.S. court opinions more accessible to the general public.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bauerem/legal_memsum
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Legal Education and Practice Innovations · Natural Language Processing Techniques