LegalTurk Optimized BERT for Multi-Label Text Classification and NER

Farnaz Zeidi; Mehmet Fatih Amasyali; \c{C}i\u{g}dem Erol

arXiv:2407.00648·cs.CL·July 2, 2024

LegalTurk Optimized BERT for Multi-Label Text Classification and NER

Farnaz Zeidi, Mehmet Fatih Amasyali, \c{C}i\u{g}dem Erol

PDF

Open Access 2 Models

TL;DR

This paper introduces a modified pre-training approach for BERT tailored to the legal Turkish domain, improving performance in NER and multi-label classification tasks even with smaller training corpora.

Contribution

The study presents a novel pre-training method combining diverse masking strategies specifically for legal Turkish BERT models, enhancing downstream task performance.

Findings

01

Modified pre-training improves NER and classification accuracy

02

Models trained on smaller corpora outperform original BERT

03

Proposed approach is competitive with BERTurk despite less data

Abstract

The introduction of the Transformer neural network, along with techniques like self-supervised pre-training and transfer learning, has paved the way for advanced models like BERT. Despite BERT's impressive performance, opportunities for further enhancement exist. To our knowledge, most efforts are focusing on improving BERT's performance in English and in general domains, with no study specifically addressing the legal Turkish domain. Our study is primarily dedicated to enhancing the BERT model within the legal Turkish domain through modifications in the pre-training phase. In this work, we introduce our innovative modified pre-training approach by combining diverse masking strategies. In the fine-tuning task, we focus on two essential downstream tasks in the legal domain: name entity recognition and multi-label text classification. To evaluate our modified pre-training approach, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Weight Decay · Multi-Head Attention · WordPiece · Softmax · Layer Normalization · Focus · Byte Pair Encoding