Developing and Evaluating Tiny to Medium-Sized Turkish BERT Models
Himmet Toprak Kesgin, Muzaffer Kaan Yuce, Mehmet Fatih Amasyali

TL;DR
This paper develops and evaluates small to medium-sized Turkish BERT models trained on diverse data, demonstrating their effectiveness across multiple NLP tasks with high efficiency and robustness.
Contribution
Introduces and assesses new tiny to medium-sized Turkish BERT models, filling a research gap for less-resourced languages with comprehensive evaluation.
Findings
Models perform well on various NLP tasks.
Small models achieve competitive results.
Efficient and faster than larger counterparts.
Abstract
This study introduces and evaluates tiny, mini, small, and medium-sized uncased Turkish BERT models, aiming to bridge the research gap in less-resourced languages. We trained these models on a diverse dataset encompassing over 75GB of text from multiple sources and tested them on several tasks, including mask prediction, sentiment analysis, news classification, and, zero-shot classification. Despite their smaller size, our models exhibited robust performance, including zero-shot task, while ensuring computational efficiency and faster execution times. Our findings provide valuable insights into the development and application of smaller language models, especially in the context of the Turkish language.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ytu-ce-cosmos/turkish-tiny-bert-uncasedmodel· 88 dl· ♡ 788 dl♡ 7
- 🤗ytu-ce-cosmos/turkish-mini-bert-uncasedmodel· 12 dl· ♡ 912 dl♡ 9
- 🤗ytu-ce-cosmos/turkish-small-bert-uncasedmodel· 57 dl· ♡ 657 dl♡ 6
- 🤗ytu-ce-cosmos/turkish-medium-bert-uncasedmodel· 15 dl· ♡ 615 dl♡ 6
- 🤗ytu-ce-cosmos/turkish-base-bert-uncasedmodel· 342 dl· ♡ 22342 dl♡ 22
- 🤗ytu-ce-cosmos/turkish-base-bert-punctuation-correctionmodel· 8 dl· ♡ 148 dl♡ 14
- 🤗ytu-ce-cosmos/turkish-base-bert-capitalization-correctionmodel· 7 dl· ♡ 147 dl♡ 14
- 🤗ytu-ce-cosmos/turkish-large-bert-casedmodel· 28 dl· ♡ 628 dl♡ 6
- 🤗ytu-ce-cosmos/turkish-colbertmodel· 128 dl· ♡ 43128 dl♡ 43
- 🤗yazge/turkish-colbert-onnxmodel· 8 dl· ♡ 18 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Weight Decay · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Adam · Attention Dropout
