TinyLlama: An Open-Source Small Language Model
Peiyuan Zhang, Guangtao Zeng, Tianduo Wang, Wei Lu

TL;DR
TinyLlama is a 1.1-billion-parameter open-source language model that, through efficient training and architecture, achieves strong performance on downstream tasks, surpassing similar-sized open-source models.
Contribution
It introduces TinyLlama, a small, efficient language model built on Llama 2 architecture, with improved training techniques and community-driven advances, outperforming comparable open-source models.
Findings
Outperforms existing open-source models of similar size
Achieves strong results on various downstream tasks
Demonstrates high computational efficiency
Abstract
We present TinyLlama, a compact 1.1B language model pretrained on around 1 trillion tokens for approximately 3 epochs. Building on the architecture and tokenizer of Llama 2, TinyLlama leverages various advances contributed by the open-source community (e.g., FlashAttention and Lit-GPT), achieving better computational efficiency. Despite its relatively small size, TinyLlama demonstrates remarkable performance in a series of downstream tasks. It significantly outperforms existing open-source language models with comparable sizes. Our model checkpoints and code are publicly available on GitHub at https://github.com/jzhang38/TinyLlama.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗TinyLlama/TinyLlama_v1.1model· 18k dl· ♡ 11118k dl♡ 111
- 🤗TinyLlama/TinyLlama_v1.1_math_codemodel· 944 dl· ♡ 14944 dl♡ 14
- 🤗TinyLlama/TinyLlama_v1.1_chinesemodel· 157 dl· ♡ 14157 dl♡ 14
- 🤗keeeeenw/MicroLlamamodel· 369 dl· ♡ 53369 dl♡ 53
- 🤗cortexso/tinyllamamodel· 98 dl· ♡ 198 dl♡ 1
- 🤗Kearm/TinyLlama_v1.1-SafeTensors-BF16model· 2 dl2 dl
- 🤗QuantFactory/TinyLlama_v1.1-GGUFmodel· 84 dl· ♡ 384 dl♡ 3
- 🤗RichardErkhov/keeeeenw_-_MicroLlama-ggufmodel· 32 dl32 dl
- 🤗MatteoKhan/TinyStable-Hybrid-1.6Bmodel· 5 dl5 dl
- 🤗cahlen/tinyllama-motorcycle-repair-qa-adaptermodel· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression
