A systematical procedure to extracting legal entities from Indonesian judicial decisions
Eka Qadri Nuranti, Naili Suri Intizhami, Evi Yulianti, A. Muh. Iqbal Latief, Osama Iyad Al Ghozy

TL;DR
The paper introduces a systematic method for extracting legal entities from Indonesian court decisions using named entity recognition, aiming to improve legal transparency and analysis.
Contribution
A structured pipeline for gathering, filtering, and annotating Indonesian judicial decisions with over 50 legal entity types using NER techniques.
Findings
A dataset of 2687 annotated court documents was created using manual annotation and inter-annotator agreement (Fleiss' Kappa score of 0.705).
The method supports downstream NLP tasks like legal information extraction and classification through token-level translation and BIO tagging.
Abstract
This article presents a systematic method of extracting legal entities from Indonesian judicial decisions with a well-structured named entity recognition (NER) approach. The procedure was implemented by gathering and annotating court decisions for theft cases at three court levels: first instance (2478 files), appeal (147 files), and cassation (62 files), amounting to 2687 annotated files. The data were harvested from the official website of the Supreme Court of the Republic of Indonesia using automated web scraping, followed by manual filtering for relevance and completeness. Manual annotation was performed with the Label Studio platform by three independent annotators. Annotation consistency was considered using Fleiss' Kappa, yielding an average agreement score of 0.705 across all levels, indicating good inter-annotator reliability. The method uses a hierarchical structure and a BIO…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Law, Economics, and Judicial Systems · Comparative and International Law Studies
