A Survey of Classification Tasks and Approaches for Legal Contracts
Amrita Singh, Aditya Joshi, Jiaojiao Jiang, Hye-young Paik

TL;DR
This survey reviews the challenges, datasets, and methodologies for automatic legal contract classification, highlighting current approaches and future research directions to improve legal NLP applications.
Contribution
It provides the first comprehensive overview of classification tasks, datasets, and methods in legal contract classification, including a taxonomy of approaches.
Findings
Seven classification tasks identified in LCC
Fourteen datasets reviewed for English contracts
Transformer-based approaches show promising results
Abstract
Given the large size and volumes of contracts and their underlying inherent complexity, manual reviews become inefficient and prone to errors, creating a clear need for automation. Automatic Legal Contract Classification (LCC) revolutionizes the way legal contracts are analyzed, offering substantial improvements in speed, accuracy, and accessibility. This survey delves into the challenges of automatic LCC and a detailed examination of key tasks, datasets, and methodologies. We identify seven classification tasks within LCC, and review fourteen datasets related to English-language contracts, including public, proprietary, and non-public sources. We also introduce a methodology taxonomy for LCC, categorized into Traditional Machine Learning, Deep Learning, and Transformer-based approaches. Additionally, the survey discusses evaluation techniques and highlights the best-performing results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
