Introducing DictaLM -- A Large Generative Language Model for Modern   Hebrew

Shaltiel Shmidman; Avi Shmidman; Amir David Nissan Cohen; Moshe Koppel

arXiv:2309.14568·cs.CL·September 27, 2023

Introducing DictaLM -- A Large Generative Language Model for Modern Hebrew

Shaltiel Shmidman, Avi Shmidman, Amir David Nissan Cohen, Moshe Koppel

PDF

Open Access

TL;DR

DictaLM is a large-scale Hebrew language model with 7 billion parameters, designed to advance Hebrew NLP research and applications, including modern and Rabbinic Hebrew, by providing open foundation and instruct-tuned models.

Contribution

This paper introduces DictaLM, the first large Hebrew language model with open access, and a specialized Rabbinic Hebrew model, supporting diverse Hebrew NLP tasks.

Findings

01

Model achieves promising performance on Hebrew NLP tasks.

02

Open release fosters research and development in Hebrew language processing.

03

Supports multiple Hebrew dialects and historical forms.

Abstract

We present DictaLM, a large-scale language model tailored for Modern Hebrew. Boasting 7B parameters, this model is predominantly trained on Hebrew-centric data. As a commitment to promoting research and development in the Hebrew language, we release both the foundation model and the instruct-tuned model under a Creative Commons license. Concurrently, we introduce DictaLM-Rab, another foundation model geared towards Rabbinic/Historical Hebrew. These foundation models serve as ideal starting points for fine-tuning various Hebrew-specific tasks, such as instruction, Q&A, sentiment analysis, and more. This release represents a preliminary step, offering an initial Hebrew LLM model for the Hebrew NLP community to experiment with.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques