Mining Legal Arguments to Study Judicial Formalism

Tom\'a\v{s} Koref; Lena Held; Mahammad Namazov; Harun Kumru; Yassine Thlija; Ivan Habernal

arXiv:2512.11374·cs.CL·March 24, 2026

Mining Legal Arguments to Study Judicial Formalism

Tom\'a\v{s} Koref, Lena Held, Mahammad Namazov, Harun Kumru, Yassine Thlija, Ivan Habernal

PDF

Open Access 3 Models

TL;DR

This paper develops automated NLP methods to analyze judicial reasoning and formalism in Czech Supreme Court decisions, creating datasets and models that improve understanding of legal argumentation and challenge existing narratives about formalism in Central and Eastern Europe.

Contribution

It introduces the MADON dataset, adapts transformer models for Czech legal texts, and presents a pipeline for classifying judicial reasoning and formalism with high accuracy.

Findings

01

Models detect argumentative paragraphs with 82.6% F1 score.

02

Legal argument classification achieves 77.5% F1 score.

03

Decision formalism classification reaches 83.8% F1 score.

Abstract

Courts must justify their decisions, but systematically analyzing judicial reasoning at scale remains difficult. This study tests claims about formalistic judging in Central and Eastern Europe (CEE) by developing automated methods to detect and classify judicial reasoning in decisions of Czech Supreme Courts using state-of-the-art natural language processing methods. We create the MADON dataset of 272 decisions from two Czech Supreme Courts with expert annotations of 9,183 paragraphs with eight argument types and holistic formalism labels for supervised training and evaluation. Using a corpus of 300,511 Czech court decisions, we adapt transformer LLMs to Czech legal domain through continued pretraining and we experiment with methods to address dataset imbalance including asymmetric loss and class weighting. The best models can detect argumentative paragraphs (82.6% Bal-F1), classify…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Legal Language and Interpretation · Multi-Agent Systems and Negotiation