Exploring the Use of Text Classification in the Legal Domain

Octavia-Maria Sulea; Marcos Zampieri; Shervin Malmasi; Mihaela Vela,; Liviu P. Dinu; Josef van Genabith

arXiv:1710.09306·cs.CL·October 26, 2017·105 cites

Exploring the Use of Text Classification in the Legal Domain

Octavia-Maria Sulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela,, Liviu P. Dinu, Josef van Genabith

PDF

Open Access

TL;DR

This paper explores applying machine learning-based text classification to legal documents, achieving high accuracy in predicting case rulings, law areas, and ruling dates, thereby supporting legal professionals.

Contribution

It introduces an ensemble SVM approach and analyzes factors like case description masking and temporal influences on classification performance.

Findings

01

98% F1 score in ruling prediction

02

96% F1 score in law area classification

03

87.07% F1 score in ruling date estimation

Abstract

In this paper, we investigate the application of text classification methods to support law professionals. We present several experiments applying machine learning techniques to predict with high accuracy the ruling of the French Supreme Court and the law area to which a case belongs to. We also investigate the influence of the time period in which a ruling was made on the form of the case description and the extent to which we need to mask information in a full case ruling to automatically obtain training and test data that resembles case descriptions. We developed a mean probability ensemble system combining the output of multiple SVM classifiers. We report results of 98% average F1 score in predicting a case ruling, 96% F1 score for predicting the law area of a case, and 87.07% F1 score on estimating the date of a ruling.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Legal Education and Practice Innovations · Comparative and International Law Studies

MethodsSupport Vector Machine