Developing and Evaluating Tiny to Medium-Sized Turkish BERT Models

Himmet Toprak Kesgin; Muzaffer Kaan Yuce; Mehmet Fatih Amasyali

arXiv:2307.14134·cs.CL·July 27, 2023·2 cites

Developing and Evaluating Tiny to Medium-Sized Turkish BERT Models

Himmet Toprak Kesgin, Muzaffer Kaan Yuce, Mehmet Fatih Amasyali

PDF

Open Access 10 Models

TL;DR

This paper develops and evaluates small to medium-sized Turkish BERT models trained on diverse data, demonstrating their effectiveness across multiple NLP tasks with high efficiency and robustness.

Contribution

Introduces and assesses new tiny to medium-sized Turkish BERT models, filling a research gap for less-resourced languages with comprehensive evaluation.

Findings

01

Models perform well on various NLP tasks.

02

Small models achieve competitive results.

03

Efficient and faster than larger counterparts.

Abstract

This study introduces and evaluates tiny, mini, small, and medium-sized uncased Turkish BERT models, aiming to bridge the research gap in less-resourced languages. We trained these models on a diverse dataset encompassing over 75GB of text from multiple sources and tested them on several tasks, including mask prediction, sentiment analysis, news classification, and, zero-shot classification. Despite their smaller size, our models exhibited robust performance, including zero-shot task, while ensuring computational efficiency and faster execution times. Our findings provide valuable insights into the development and application of smaller language models, especially in the context of the Turkish language.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Weight Decay · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Adam · Attention Dropout