Qwen2.5 Technical Report

Qwen: An Yang; Baosong Yang; Beichen Zhang; Binyuan Hui; Bo Zheng,; Bowen Yu; Chengyuan Li; Dayiheng Liu; Fei Huang; Haoran Wei; Huan Lin; Jian; Yang; Jianhong Tu; Jianwei Zhang; Jianxin Yang; Jiaxi Yang; Jingren Zhou,; Junyang Lin; Kai Dang; Keming Lu; Keqin Bao; Kexin Yang; Le Yu; Mei Li,; Mingfeng Xue; Pei Zhang; Qin Zhu; Rui Men; Runji Lin; Tianhao Li; Tianyi; Tang; Tingyu Xia; Xingzhang Ren; Xuancheng Ren; Yang Fan; Yang Su; Yichang; Zhang; Yu Wan; Yuqiong Liu; Zeyu Cui; Zhenru Zhang; Zihan Qiu (additional; authors not shown)

arXiv:2412.15115·cs.CL·January 6, 2025·68 cites

Qwen2.5 Technical Report

Qwen: An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng,, Bowen Yu, Chengyuan Li, Dayiheng Liu, Fei Huang, Haoran Wei, Huan Lin, Jian, Yang, Jianhong Tu, Jianwei Zhang, Jianxin Yang, Jiaxi Yang, Jingren Zhou,, Junyang Lin, Kai Dang, Keming Lu, Keqin Bao, Kexin Yang

PDF

Open Access 5 Repos 10 Models 2 Datasets

TL;DR

Qwen2.5 is a series of large language models with significant improvements in training data, fine-tuning, and reinforcement learning, achieving top-tier benchmark performance and supporting diverse applications including instruction tuning and specialized models.

Contribution

Introduction of Qwen2.5 models with scaled datasets, advanced post-training techniques, and multiple sizes, including proprietary MoE variants, demonstrating state-of-the-art performance and versatility.

Findings

01

Qwen2.5-72B-Instruct outperforms many open and proprietary models.

02

Models demonstrate competitive performance to larger models like Llama-3-405B.

03

Qwen2.5 variants are effective in training specialized models for math, coding, and multimodal tasks.

Abstract

In this report, we introduce Qwen2.5, a comprehensive series of large language models (LLMs) designed to meet diverse needs. Compared to previous iterations, Qwen 2.5 has been significantly improved during both the pre-training and post-training stages. In terms of pre-training, we have scaled the high-quality pre-training datasets from the previous 7 trillion tokens to 18 trillion tokens. This provides a strong foundation for common sense, expert knowledge, and reasoning capabilities. In terms of post-training, we implement intricate supervised finetuning with over 1 million samples, as well as multistage reinforcement learning. Post-training techniques enhance human preference, and notably improve long text generation, structural data analysis, and instruction following. To handle diverse and varied use cases effectively, we present Qwen2.5 LLM series in rich sizes. Open-weight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Machine Learning and Data Classification

MethodsBalanced Selection