Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert and, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra and, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher and, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, and Jude Fernandes

TL;DR
Llama 2 is a set of large language models, including chat-optimized versions, that outperform existing open-source models in benchmarks and safety, aiming to be a responsible open alternative.
Contribution
We introduce Llama 2, a new open-source LLM family with fine-tuned chat models, and detail our approach to safety and fine-tuning for community use.
Findings
Llama 2 models outperform open-source chat models on most benchmarks.
Llama 2-Chat models show high helpfulness and safety in evaluations.
Our approach facilitates responsible development of LLMs.
Abstract
In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be a suitable substitute for closed-source models. We provide a detailed description of our approach to fine-tuning and safety improvements of Llama 2-Chat in order to enable the community to build on our work and contribute to the responsible development of LLMs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗meta-llama/Llama-2-7bmodel· 253 dl· ♡ 4467253 dl♡ 4467
- 🤗meta-llama/Llama-2-7b-hfmodel· 891k dl· ♡ 2287891k dl♡ 2287
- 🤗meta-llama/Llama-2-7b-chat-hfmodel· 453k dl· ♡ 4724453k dl♡ 4724
- 🤗meta-llama/Llama-2-7b-chatmodel· 61 dl· ♡ 61761 dl♡ 617
- 🤗meta-llama/Llama-2-13b-chat-hfmodel· 150k dl· ♡ 1113150k dl♡ 1113
- 🤗meta-llama/Llama-2-70b-chat-hfmodel· 22k dl· ♡ 220522k dl♡ 2205
- 🤗TheBloke/Llama-2-13B-chat-GGUFmodel· 7.0k dl· ♡ 2047.0k dl♡ 204
- 🤗meta-llama/Llama-2-13bmodel· 35 dl· ♡ 35235 dl♡ 352
- 🤗meta-llama/Llama-2-13b-chatmodel· 15 dl· ♡ 29615 dl♡ 296
- 🤗meta-llama/Llama-2-70bmodel· 24 dl· ♡ 53824 dl♡ 538
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsFeedforward Network · Grouped-query attention · Rotary Position Embedding · Root Mean Square Layer Normalization · AdamW · Absolute Position Encodings · Residual Connection · Label Smoothing · Dropout · Byte Pair Encoding
