Code Llama: Open Foundation Models for Code
Baptiste Rozi\`ere, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai, Gat, Xiaoqing Ellen Tan, Yossi Adi, Jingyu Liu, Romain Sauvestre, Tal Remez,, J\'er\'emy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish, Bhatt, Cristian Canton Ferrer, Aaron Grattafiori

TL;DR
Code Llama introduces a family of open-source large language models for code, offering state-of-the-art performance, infilling, large context support, and instruction-following capabilities across various sizes and specializations.
Contribution
The paper presents a new family of open models for code with improved performance, infilling, and large context handling, including Python-specific and instruction-following variants.
Findings
Achieves up to 67% on HumanEval benchmark.
Outperforms Llama 2 70B on key code tasks.
Outperforms all publicly available models on MultiPL-E.
Abstract
We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following ability for programming tasks. We provide multiple flavors to cover a wide range of applications: foundation models (Code Llama), Python specializations (Code Llama - Python), and instruction-following models (Code Llama - Instruct) with 7B, 13B, 34B and 70B parameters each. All models are trained on sequences of 16k tokens and show improvements on inputs with up to 100k tokens. 7B, 13B and 70B Code Llama and Code Llama - Instruct variants support infilling based on surrounding content. Code Llama reaches state-of-the-art performance among open models on several code benchmarks, with scores of up to 67% and 65% on HumanEval and MBPP, respectively. Notably,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗TheBloke/CodeLlama-7B-GGUFmodel· 5.2k dl· ♡ 1315.2k dl♡ 131
- 🤗TheBloke/CodeLlama-70B-Python-GGUFmodel· 1.1k dl· ♡ 451.1k dl♡ 45
- 🤗codellama/CodeLlama-7b-hfmodel· 86k dl· ♡ 37586k dl♡ 375
- 🤗codellama/CodeLlama-7b-Python-hfmodel· 4.6k dl· ♡ 1444.6k dl♡ 144
- 🤗codellama/CodeLlama-13b-hfmodel· 3.2k dl· ♡ 1183.2k dl♡ 118
- 🤗codellama/CodeLlama-13b-Python-hfmodel· 1.0k dl· ♡ 511.0k dl♡ 51
- 🤗codellama/CodeLlama-7b-Instruct-hfmodel· 258k dl· ♡ 255258k dl♡ 255
- 🤗codellama/CodeLlama-13b-Instruct-hfmodel· 8.9k dl· ♡ 1598.9k dl♡ 159
- 🤗codellama/CodeLlama-34b-hfmodel· 2.9k dl· ♡ 1782.9k dl♡ 178
- 🤗codellama/CodeLlama-34b-Python-hfmodel· 1.1k dl· ♡ 991.1k dl♡ 99
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Advanced Database Systems and Queries · Semantic Web and Ontologies
