Llama 2: Open Foundation and Fine-Tuned Chat Models

Hugo Touvron; Louis Martin; Kevin Stone; Peter Albert and; Amjad Almahairi; Yasmine Babaei; Nikolay Bashlykov; Soumya Batra and; Prajjwal Bhargava; Shruti Bhosale; Dan Bikel; Lukas Blecher and; Cristian Canton Ferrer; Moya Chen; Guillem Cucurull; David Esiobu; and Jude Fernandes; Jeremy Fu; Wenyin Fu; Brian Fuller; Cynthia; Gao; Vedanuj Goswami; Naman Goyal; Anthony Hartshorn; Saghar; Hosseini; Rui Hou; Hakan Inan; Marcin Kardas; Viktor Kerkez and; Madian Khabsa; Isabel Kloumann; Artem Korenev; Punit Singh Koura and; Marie-Anne Lachaux; Thibaut Lavril; Jenya Lee; Diana Liskovich and; Yinghai Lu; Yuning Mao; Xavier Martinet; Todor Mihaylov; Pushkar; Mishra; Igor Molybog; Yixin Nie; Andrew Poulton; Jeremy; Reizenstein; Rashi Rungta; Kalyan Saladi; Alan Schelten; Ruan; Silva; Eric Michael Smith; Ranjan Subramanian; Xiaoqing Ellen Tan; and Binh Tang; Ross Taylor; Adina Williams; Jian Xiang Kuan and; Puxin Xu; Zheng Yan; Iliyan Zarov; Yuchen Zhang; Angela Fan and; Melanie Kambadur; Sharan Narang; Aurelien Rodriguez; Robert Stojnic; and Sergey Edunov; Thomas Scialom

arXiv:2307.09288·cs.CL·July 20, 2023·2.6k cites

Llama 2: Open Foundation and Fine-Tuned Chat Models

Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert and, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra and, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher and, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, and Jude Fernandes

PDF

Open Access 5 Repos 10 Models 5 Datasets

TL;DR

Llama 2 is a set of large language models, including chat-optimized versions, that outperform existing open-source models in benchmarks and safety, aiming to be a responsible open alternative.

Contribution

We introduce Llama 2, a new open-source LLM family with fine-tuned chat models, and detail our approach to safety and fine-tuning for community use.

Findings

01

Llama 2 models outperform open-source chat models on most benchmarks.

02

Llama 2-Chat models show high helpfulness and safety in evaluations.

03

Our approach facilitates responsible development of LLMs.

Abstract

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be a suitable substitute for closed-source models. We provide a detailed description of our approach to fine-tuning and safety improvements of Llama 2-Chat in order to enable the community to build on our work and contribute to the responsible development of LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsFeedforward Network · Grouped-query attention · Rotary Position Embedding · Root Mean Square Layer Normalization · AdamW · Absolute Position Encodings · Residual Connection · Label Smoothing · Dropout · Byte Pair Encoding