DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
Shaltiel Shmidman, Avi Shmidman, Moshe Koppel

TL;DR
DictaBERT is a new state-of-the-art Hebrew BERT model that outperforms existing models and provides fine-tuned versions for key Hebrew NLP tasks, facilitating easier development and research.
Contribution
Introduction of DictaBERT, a powerful Hebrew BERT model, along with three fine-tuned models for prefix segmentation, morphological tagging, and question answering.
Findings
Outperforms existing Hebrew BERT models on benchmarks
Provides accessible models for Hebrew NLP tasks
Facilitates research and development in Hebrew language processing
Abstract
We present DictaBERT, a new state-of-the-art pre-trained BERT model for modern Hebrew, outperforming existing models on most benchmarks. Additionally, we release three fine-tuned versions of the model, designed to perform three specific foundational tasks in the analysis of Hebrew texts: prefix segmentation, morphological tagging and question answering. These fine-tuned models allow any developer to perform prefix segmentation, morphological tagging and question answering of a Hebrew input with a single call to a HuggingFace model, without the need to integrate any additional libraries or code. In this paper we describe the details of the training as well and the results on the different benchmarks. We release the models to the community, along with sample code demonstrating their use. We release these models as part of our goal to help further research and development in Hebrew NLP.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗dicta-il/mt5-xl-heqmodel· 2 dl2 dl
- 🤗dicta-il/dictabert-segmodel· 669 dl669 dl
- 🤗dicta-il/dictabertmodel· 1.7k dl· ♡ 121.7k dl♡ 12
- 🤗dicta-il/dictabert-morphmodel· 3.7k dl3.7k dl
- 🤗dicta-il/dictabert-heqmodel· 44 dl· ♡ 144 dl♡ 1
- 🤗dicta-il/dictabert-largemodel· 24 dl· ♡ 124 dl♡ 1
- 🤗dicta-il/dictabert-large-heqmodel· 20 dl20 dl
- 🤗dicta-il/dictabert-nermodel· 853 dl· ♡ 6853 dl♡ 6
- 🤗dicta-il/dictabert-large-nermodel· 1.5k dl1.5k dl
- 🤗dicta-il/dictabert-syntaxmodel· 71 dl· ♡ 171 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsAttention Is All You Need · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Linear Warmup With Linear Decay · Weight Decay · WordPiece · Layer Normalization · Linear Layer · Attention Dropout
