AraBERT: Transformer-based Model for Arabic Language Understanding
Wissam Antoun, Fady Baly, Hazem Hajj

TL;DR
This paper introduces AraBERT, a transformer-based language model specifically pre-trained for Arabic, achieving state-of-the-art results on various Arabic NLP tasks and providing resources to advance Arabic language understanding.
Contribution
The paper presents the first Arabic-specific BERT model pre-trained on a large corpus, outperforming multilingual BERT on multiple NLP tasks for Arabic.
Findings
AraBERT outperforms multilingual BERT on Arabic NLP tasks.
Pretrained AraBERT models are publicly available for research.
Achieved state-of-the-art results on several Arabic NLP benchmarks.
Abstract
The Arabic language is a morphologically rich language with relatively few resources and a less explored syntax compared to English. Given these limitations, Arabic Natural Language Processing (NLP) tasks like Sentiment Analysis (SA), Named Entity Recognition (NER), and Question Answering (QA), have proven to be very challenging to tackle. Recently, with the surge of transformers based models, language-specific BERT based models have proven to be very efficient at language understanding, provided they are pre-trained on a very large corpus. Such models were able to set new standards and achieve state-of-the-art results for most NLP tasks. In this paper, we pre-trained BERT specifically for the Arabic language in the pursuit of achieving the same success that BERT did for the English language. The performance of AraBERT is compared to multilingual BERT from Google and other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗aubmindlab/bert-base-arabertmodel· 871 dl· ♡ 33871 dl♡ 33
- 🤗aubmindlab/bert-base-arabertv01model· 40 dl· ♡ 340 dl♡ 3
- 🤗aubmindlab/bert-base-arabertv02-twittermodel· 11k dl· ♡ 811k dl♡ 8
- 🤗aubmindlab/bert-base-arabertv02model· 567k dl· ♡ 43567k dl♡ 43
- 🤗aubmindlab/bert-base-arabertv2model· 43k dl· ♡ 4243k dl♡ 42
- 🤗aubmindlab/bert-large-arabertv02-twittermodel· 235 dl· ♡ 4235 dl♡ 4
- 🤗aubmindlab/bert-large-arabertv02model· 567 dl· ♡ 10567 dl♡ 10
- 🤗aubmindlab/bert-large-arabertv2model· 1.3k dl· ♡ 121.3k dl♡ 12
- 🤗MaagDeveloper/aubmindlab-bert-base-arabertv02model
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
