Robust Deep Reinforcement Learning for Extractive Legal Summarization

Duy-Hung Nguyen; Bao-Sinh Nguyen; Nguyen Viet Dung Nghiem; Dung Tien; Le; Mim Amina Khatun; Minh-Tien Nguyen; and Hung Le

arXiv:2111.07158·cs.CL·April 14, 2022

Robust Deep Reinforcement Learning for Extractive Legal Summarization

Duy-Hung Nguyen, Bao-Sinh Nguyen, Nguyen Viet Dung Nghiem, Dung Tien, Le, Mim Amina Khatun, Minh-Tien Nguyen, and Hung Le

PDF

TL;DR

This paper introduces a reinforcement learning approach with novel reward functions to enhance deep summarization models for legal texts, achieving significant improvements across multiple datasets.

Contribution

It presents a reinforcement learning framework with new reward functions tailored for legal summarization, improving existing deep models' performance.

Findings

01

Significant performance gains on 3 legal datasets

02

Reinforcement learning outperforms traditional training methods

03

Novel reward functions effectively balance lexical and semantic quality

Abstract

Automatic summarization of legal texts is an important and still a challenging task since legal documents are often long and complicated with unusual structures and styles. Recent advances of deep models trained end-to-end with differentiable losses can well-summarize natural text, yet when applied to legal domain, they show limited results. In this paper, we propose to use reinforcement learning to train current deep summarization models to improve their performance on the legal domain. To this end, we adopt proximal policy optimization methods and introduce novel reward functions that encourage the generation of candidate summaries satisfying both lexical and semantic criteria. We apply our method to training different summarization backbones and observe a consistent and significant performance gain across 3 public legal datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.