Harnessing the Power of BERT in the Turkish Clinical Domain: Pretraining   Approaches for Limited Data Scenarios

Hazal T\"urkmen; O\u{g}uz Dikenelli; Cenk Eraslan; Mehmet Cem; \c{C}all{\i}; S\"uha S\"ureyya \"Ozbek

arXiv:2305.03788·cs.CL·May 9, 2023·1 cites

Harnessing the Power of BERT in the Turkish Clinical Domain: Pretraining Approaches for Limited Data Scenarios

Hazal T\"urkmen, O\u{g}uz Dikenelli, Cenk Eraslan, Mehmet Cem, \c{C}all{\i}, S\"uha S\"ureyya \"Ozbek

PDF

Open Access

TL;DR

This study investigates pretraining strategies for Turkish clinical language models using BERT, emphasizing the importance of domain-specific vocabulary and combining general and task-specific knowledge to improve performance in limited data scenarios.

Contribution

It introduces new pretraining approaches for Turkish clinical NLP, evaluates their effectiveness, and highlights the importance of vocabulary and combined training methods in low-resource settings.

Findings

01

General Turkish BERT and TurkRadBERT-task v1 perform best.

02

Task-adaptive pretraining can overfit with limited data.

03

Domain-specific vocabulary enhances model performance.

Abstract

In recent years, major advancements in natural language processing (NLP) have been driven by the emergence of large language models (LLMs), which have significantly revolutionized research and development within the field. Building upon this progress, our study delves into the effects of various pre-training methodologies on Turkish clinical language models' performance in a multi-label classification task involving radiology reports, with a focus on addressing the challenges posed by limited language resources. Additionally, we evaluated the simultaneous pretraining approach by utilizing limited clinical task data for the first time. We developed four models, including TurkRadBERT-task v1, TurkRadBERT-task v2, TurkRadBERT-sim v1, and TurkRadBERT-sim v2. Our findings indicate that the general Turkish BERT model (BERTurk) and TurkRadBERT-task v1, both of which utilize knowledge from a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Linear Layer · WordPiece · Dropout · Weight Decay · Attention Dropout