NorwAI's Large Language Models: Technical Report
Jon Atle Gulla, Peng Liu, Lemei Zhang

TL;DR
NorwAI developed a family of Norwegian and Scandinavian language models based on Transformer architectures, optimized through extensive training and fine-tuning, to improve NLP performance and accessibility for underrepresented languages.
Contribution
The paper introduces a new suite of Norwegian-focused large language models built on diverse Transformer architectures, with detailed training, fine-tuning, and deployment strategies, and open access for research.
Findings
Models demonstrate strong performance on real-world tasks.
Instruction-tuned variants show effective assistant capabilities.
Open availability supports further research and application.
Abstract
Norwegian, spoken by approximately five million people, remains underrepresented in many of the most significant breakthroughs in Natural Language Processing (NLP). To address this gap, the NorLLM team at NorwAI has developed a family of models specifically tailored to Norwegian and other Scandinavian languages, building on diverse Transformer-based architectures such as GPT, Mistral, Llama2, Mixtral and Magistral. These models are either pretrained from scratch or continually pretrained on 25B - 88.45B tokens, using a Norwegian-extended tokenizer and advanced post-training strategies to optimize performance, enhance robustness, and improve adaptability across various real-world tasks. Notably, instruction-tuned variants (e.g., Mistral-7B-Instruct and Mixtral-8x7B-Instruct) showcase strong assistant-style capabilities, underscoring their potential for practical deployment in interactive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗NorGLM/NorGPT-369Mmodel· 1.2k dl· ♡ 21.2k dl♡ 2
- 🤗NorGLM/NorGPT-3Bmodel· 7 dl· ♡ 27 dl♡ 2
- 🤗NorGLM/NorLlama-3Bmodel· 12 dl· ♡ 212 dl♡ 2
- 🤗NorGLM/NorGPT-3B-continuemodel· 37 dl37 dl
- 🤗NorwAI/NorwAI-Llama2-7Bmodel· 75 dl· ♡ 975 dl♡ 9
- 🤗NorwAI/NorwAI-Mistral-7Bmodel· 113 dl· ♡ 10113 dl♡ 10
- 🤗NorwAI/NorwAI-Mistral-7B-pretrainmodel· 102 dl· ♡ 3102 dl♡ 3
- 🤗NorwAI/NorwAI-Mixtral-8x7Bmodel· 174 dl· ♡ 5174 dl♡ 5
- 🤗NorwAI/NorwAI-Mixtral-8x7B-instructmodel· 179 dl· ♡ 2179 dl♡ 2
- 🤗NorwAI/NorwAI-Mistral-7B-instructmodel· 2.0k dl· ♡ 102.0k dl♡ 10
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Artificial Intelligence in Healthcare and Education
