Predicting civil litigation outcomes and the evolution of case complexity and settlement dynamics
Sandro Claudio Lera, Shahrokh Firouzi, Jonathan Habshush, Robert Mahari

TL;DR
This study develops a temporal framework for predicting civil litigation outcomes using over 835,000 court filings, revealing how case complexity and information influence predictability and settlement dynamics.
Contribution
It introduces a novel sequential modeling approach that incorporates legal features and case heterogeneity to predict litigation outcomes and analyze complexity evolution.
Findings
Prediction accuracy reaches up to 97% for high-confidence plaintiff wins.
Rich information improves predictions mainly in low-complexity cases.
Case complexity increases over litigation, affecting settlement rates.
Abstract
Legal disputes unfold through sequences of filings in which parties update their positions and may settle at any stage. Most computational studies of legal prediction, however, focus on adjudicated outcomes and treat cases as static objects observed only at the end of litigation. Here we develop a temporally structured framework for predicting outcomes in civil litigation using 835,190 court filings between 1996 and 2022. We represent each case as a sequence of documents and model litigation as a three-outcome process: plaintiff win, plaintiff loss, or settlement. Documents are encoded using structured legal features, text embeddings, and information about judges and law firms, and a classifier estimates outcome probabilities at each stage of the case. The model achieves class-specific AUC values between 0.74 and 0.81, and reaches up to 97% accuracy for high-confidence plaintiff-win…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
