verBERT: Automating Brazilian Case Law Document Multi-label   Categorization Using BERT

Felipe R. Serras; Marcelo Finger

arXiv:2203.06224·cs.LG·March 15, 2022

verBERT: Automating Brazilian Case Law Document Multi-label Categorization Using BERT

Felipe R. Serras, Marcelo Finger

PDF

1 Repo

TL;DR

This paper explores the use of BERT-based models to automate the multi-label categorization of Brazilian case law documents, achieving significant performance improvements over baseline methods.

Contribution

It introduces a multi-label BERT approach tailored for Brazilian legal documents, demonstrating its effectiveness with substantial F1-score gains.

Findings

01

Achieved micro-averaged F1-Score of 0.72

02

Gained 30 percentage points over baseline

03

Validated the approach on datasets from the Kollemata Project

Abstract

In this work, we carried out a study about the use of attention-based algorithms to automate the categorization of Brazilian case law documents. We used data from the Kollemata Project to produce two distinct datasets with adequate class systems. Then, we implemented a multi-class and multi-label version of BERT and fine-tuned different BERT models with the produced datasets. We evaluated several metrics, adopting the micro-averaged F1-Score as our main metric for which we obtained a performance value of F1-micro=0.72 corresponding to gains of 30 percent points over the tested statistical baseline. In this work, we carried out a study about the use of attention-based algorithms to automate the categorization of Brazilian case law documents. We used data from the \textit{Kollemata} Project to produce two distinct datasets with adequate class systems. Then, we implemented a multi-class…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

frserras/verbert-categorization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Residual Connection · Weight Decay · Layer Normalization · Linear Warmup With Linear Decay · WordPiece