A Survey of Classification Tasks and Approaches for Legal Contracts

Amrita Singh; Aditya Joshi; Jiaojiao Jiang; Hye-young Paik

arXiv:2507.21108·cs.CL·July 30, 2025

A Survey of Classification Tasks and Approaches for Legal Contracts

Amrita Singh, Aditya Joshi, Jiaojiao Jiang, Hye-young Paik

PDF

TL;DR

This survey reviews the challenges, datasets, and methodologies for automatic legal contract classification, highlighting current approaches and future research directions to improve legal NLP applications.

Contribution

It provides the first comprehensive overview of classification tasks, datasets, and methods in legal contract classification, including a taxonomy of approaches.

Findings

01

Seven classification tasks identified in LCC

02

Fourteen datasets reviewed for English contracts

03

Transformer-based approaches show promising results

Abstract

Given the large size and volumes of contracts and their underlying inherent complexity, manual reviews become inefficient and prone to errors, creating a clear need for automation. Automatic Legal Contract Classification (LCC) revolutionizes the way legal contracts are analyzed, offering substantial improvements in speed, accuracy, and accessibility. This survey delves into the challenges of automatic LCC and a detailed examination of key tasks, datasets, and methodologies. We identify seven classification tasks within LCC, and review fourteen datasets related to English-language contracts, including public, proprietary, and non-public sources. We also introduce a methodology taxonomy for LCC, categorized into Traditional Machine Learning, Deep Learning, and Transformer-based approaches. Additionally, the survey discusses evaluation techniques and highlights the best-performing results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.