ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All   Tools

Team GLM: Aohan Zeng; Bin Xu; Bowen Wang; Chenhui Zhang; Da Yin; Dan; Zhang; Diego Rojas; Guanyu Feng; Hanlin Zhao; Hanyu Lai; Hao Yu; Hongning; Wang; Jiadai Sun; Jiajie Zhang; Jiale Cheng; Jiayi Gui; Jie Tang; Jing Zhang,; Jingyu Sun; Juanzi Li; Lei Zhao; Lindong Wu; Lucen Zhong; Mingdao Liu; Minlie; Huang; Peng Zhang; Qinkai Zheng; Rui Lu; Shuaiqi Duan; Shudan Zhang; Shulin; Cao; Shuxun Yang; Weng Lam Tam; Wenyi Zhao; Xiao Liu; Xiao Xia; Xiaohan; Zhang; Xiaotao Gu; Xin Lv; Xinghan Liu; Xinyi Liu; Xinyue Yang; Xixuan Song,; Xunkai Zhang; Yifan An; Yifan Xu; Yilin Niu; Yuantao Yang; Yueyan Li; Yushi; Bai; Yuxiao Dong; Zehan Qi; Zhaoyu Wang; Zhen Yang; Zhengxiao Du; Zhenyu Hou,; Zihan Wang

arXiv:2406.12793·cs.CL·July 31, 2024·176 cites

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Team GLM: Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Dan, Zhang, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning, Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang,, Jingyu Sun, Juanzi Li, Lei Zhao, Lindong Wu

PDF

Open Access 5 Repos 10 Models

TL;DR

ChatGLM is a family of large language models, with the GLM-4 series achieving performance comparable to or surpassing GPT-4 across various benchmarks, and includes tools for complex tasks like web browsing and code execution.

Contribution

This paper introduces the GLM-4 series of large language models, demonstrating state-of-the-art performance and multi-tool capabilities, with extensive open-source releases for community use.

Findings

01

GLM-4 models outperform GPT-4 on multiple benchmarks.

02

GLM-4 achieves near GPT-4-Turbo in instruction following.

03

Open-source models attract over 10 million downloads in 2023.

Abstract

We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained on ten trillions of tokens mostly in Chinese and English, along with a small set of corpus from 24 languages, and aligned primarily for Chinese and English usage. The high-quality alignment is achieved via a multi-stage post-training process, which involves supervised fine-tuning and learning from human feedback. Evaluations show that GLM-4 1) closely rivals or outperforms GPT-4 in terms of general metrics such as MMLU, GSM8K, MATH, BBH, GPQA, and HumanEval, 2) gets close to GPT-4-Turbo in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsSparse Evolutionary Training · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Linear Layer · Multi-Head Attention