TookaBERT: A Step Forward for Persian NLU

MohammadAli SadraeiJavaheri; Ali Moghaddaszadeh; Milad Molazadeh; and Fariba Naeiji; Farnaz Aghababaloo; Hamideh Rafiee; Zahra; Amirmahani; Tohid Abedini; Fatemeh Zahra Sheikhi; Amirmohammad; Salehoof

arXiv:2407.16382·cs.CL·July 24, 2024

TookaBERT: A Step Forward for Persian NLU

MohammadAli SadraeiJavaheri, Ali Moghaddaszadeh, Milad Molazadeh, and Fariba Naeiji, Farnaz Aghababaloo, Hamideh Rafiee, Zahra, Amirmahani, Tohid Abedini, Fatemeh Zahra Sheikhi, Amirmohammad, Salehoof

PDF

Open Access 2 Models

TL;DR

TookaBERT introduces two new Persian BERT models trained on Persian data, outperforming existing models on 14 NLU tasks with significant improvements, advancing Persian natural language understanding.

Contribution

The paper presents two novel Persian BERT models trained on Persian data, demonstrating superior performance over seven existing models across multiple NLU tasks.

Findings

01

Our larger model outperforms competitors by at least +2.8 points on average.

02

The models show strong effectiveness across diverse Persian NLU tasks.

03

TookaBERT advances Persian NLP with improved model performance.

Abstract

The field of natural language processing (NLP) has seen remarkable advancements, thanks to the power of deep learning and foundation models. Language models, and specifically BERT, have been key players in this progress. In this study, we trained and introduced two new BERT models using Persian data. We put our models to the test, comparing them to seven existing models across 14 diverse Persian natural language understanding (NLU) tasks. The results speak for themselves: our larger model outperforms the competition, showing an average improvement of at least +2.8 points. This highlights the effectiveness and potential of our new BERT models for Persian NLU tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Linear Warmup With Linear Decay · Dense Connections · Multi-Head Attention · Residual Connection · Dropout · WordPiece