Qwen2 Technical Report
An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou,, Chengpeng Li, Chengyuan Li, Dayiheng Liu, Fei Huang, Guanting Dong, Haoran, Wei, Huan Lin, Jialong Tang, Jialin Wang, Jian Yang, Jianhong Tu, Jianwei, Zhang, Jianxin Ma, Jianxin Yang, Jin Xu, Jingren Zhou

TL;DR
Qwen2 is a comprehensive suite of large language and multimodal models with up to 72 billion parameters, demonstrating state-of-the-art performance across diverse benchmarks and multilingual capabilities, and is openly accessible for community use.
Contribution
Introduction of the Qwen2 series, a new set of large language and multimodal models with extensive benchmarks, multilingual support, and open availability, advancing open-weight model performance.
Findings
Qwen2-72B achieves high scores on multiple benchmarks.
Qwen2 models outperform prior open-weight models.
Multilingual proficiency across 30 languages.
Abstract
This report introduces the Qwen2 series, the latest addition to our large language models and large multimodal models. We release a comprehensive suite of foundational and instruction-tuned language models, encompassing a parameter range from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts model. Qwen2 surpasses most prior open-weight models, including its predecessor Qwen1.5, and exhibits competitive performance relative to proprietary models across diverse benchmarks on language understanding, generation, multilingual proficiency, coding, mathematics, and reasoning. The flagship model, Qwen2-72B, showcases remarkable performance: 84.2 on MMLU, 37.9 on GPQA, 64.6 on HumanEval, 89.5 on GSM8K, and 82.4 on BBH as a base language model. The instruction-tuned variant, Qwen2-72B-Instruct, attains 9.1 on MT-Bench, 48.1 on Arena-Hard, and 35.7 on LiveCodeBench. Moreover,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Qwen/Qwen2.5-7B-Instructmodel· 16.1M dl· ♡ 117616.1M dl♡ 1176
- 🤗Qwen/Qwen2.5-3B-Instructmodel· 7.7M dl· ♡ 4297.7M dl♡ 429
- 🤗Qwen/Qwen2.5-Coder-7B-Instructmodel· 2.5M dl· ♡ 6762.5M dl♡ 676
- 🤗Qwen/Qwen2.5-1.5B-Instructmodel· 9.6M dl· ♡ 6519.6M dl♡ 651
- 🤗Qwen/Qwen2.5-1.5Bmodel· 753k dl· ♡ 173753k dl♡ 173
- 🤗Qwen/Qwen2.5-Coder-7B-Instruct-GGUFmodel· 102k dl· ♡ 220102k dl♡ 220
- 🤗Qwen/Qwen2.5-0.5Bmodel· 1.9M dl· ♡ 3881.9M dl♡ 388
- 🤗Qwen/Qwen2.5-0.5B-Instructmodel· 5.7M dl· ♡ 4895.7M dl♡ 489
- 🤗Qwen/Qwen2.5-32B-Instructmodel· 4.3M dl· ♡ 3424.3M dl♡ 342
- 🤗Qwen/Qwen2.5-72Bmodel· 58k dl· ♡ 9458k dl♡ 94
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsBalanced Selection
