Automatic Anonymization of Swiss Federal Supreme Court Rulings

Joel Niklaus; Robin Mami\'e; Matthias St\"urmer; Daniel Brunner,; Marcel Gygli

arXiv:2310.04632·cs.CL·November 2, 2023·2 cites

Automatic Anonymization of Swiss Federal Supreme Court Rulings

Joel Niklaus, Robin Mami\'e, Matthias St\"urmer, Daniel Brunner,, Marcel Gygli

PDF

Open Access

TL;DR

This paper improves automatic anonymization of Swiss court rulings by integrating in-domain pre-trained BERT models with traditional methods, significantly increasing accuracy and reducing manual effort.

Contribution

It introduces an enhanced anonymization system that combines regex and machine learning, leveraging in-domain pre-training for better performance.

Findings

01

In-domain pre-training improves F1-score by over 5%.

02

Combining regex with machine learning reduces manual anonymization effort.

03

The approach outperforms existing models in accuracy.

Abstract

Releasing court decisions to the public relies on proper anonymization to protect all involved parties, where necessary. The Swiss Federal Supreme Court relies on an existing system that combines different traditional computational methods with human experts. In this work, we enhance the existing anonymization software using a large dataset annotated with entities to be anonymized. We compared BERT-based models with models pre-trained on in-domain data. Our results show that using in-domain data to pre-train the models further improves the F1-score by more than 5\% compared to existing models. Our work demonstrates that combining existing anonymization methods, such as regular expressions, with machine learning can further reduce manual labor and enhance automatic suggestions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Comparative and International Law Studies