LegalTurk Optimized BERT for Multi-Label Text Classification and NER
Farnaz Zeidi, Mehmet Fatih Amasyali, \c{C}i\u{g}dem Erol

TL;DR
This paper introduces a modified pre-training approach for BERT tailored to the legal Turkish domain, improving performance in NER and multi-label classification tasks even with smaller training corpora.
Contribution
The study presents a novel pre-training method combining diverse masking strategies specifically for legal Turkish BERT models, enhancing downstream task performance.
Findings
Modified pre-training improves NER and classification accuracy
Models trained on smaller corpora outperform original BERT
Proposed approach is competitive with BERTurk despite less data
Abstract
The introduction of the Transformer neural network, along with techniques like self-supervised pre-training and transfer learning, has paved the way for advanced models like BERT. Despite BERT's impressive performance, opportunities for further enhancement exist. To our knowledge, most efforts are focusing on improving BERT's performance in English and in general domains, with no study specifically addressing the legal Turkish domain. Our study is primarily dedicated to enhancing the BERT model within the legal Turkish domain through modifications in the pre-training phase. In this work, we introduce our innovative modified pre-training approach by combining diverse masking strategies. In the fine-tuning task, we focus on two essential downstream tasks in the legal domain: name entity recognition and multi-label text classification. To evaluate our modified pre-training approach, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Weight Decay · Multi-Head Attention · WordPiece · Softmax · Layer Normalization · Focus · Byte Pair Encoding
