GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
GLM-4.5 Team: Aohan Zeng, Xin Lv, Qinkai Zheng, Zhenyu Hou, Bin Chen, Chengxing Xie, Cunxiang Wang, Da Yin, Hao Zeng, Jiajie Zhang, Kedong Wang, Lucen Zhong, Mingdao Liu, Rui Lu, Shulin Cao, Xiaohan Zhang, Xuancheng Huang, Yao Wei, Yean Cheng, Yifan An, Yilin Niu, Yuanhao Wen

TL;DR
GLM-4.5 is a large, open-source Mixture-of-Experts language model with hybrid reasoning capabilities, achieving high performance on reasoning, agentic, and coding benchmarks with fewer parameters than competitors.
Contribution
Introduction of GLM-4.5, a 355B parameter MoE model with hybrid reasoning, trained on 23T tokens, and released alongside a smaller version to advance agentic AI research.
Findings
Achieves 70.1% on TAU-Bench
Scores 91.0% on AIME 24
Releases both large and compact models
Abstract
We present GLM-4.5, an open-source Mixture-of-Experts (MoE) large language model with 355B total parameters and 32B activated parameters, featuring a hybrid reasoning method that supports both thinking and direct response modes. Through multi-stage training on 23T tokens and comprehensive post-training with expert model iteration and reinforcement learning, GLM-4.5 achieves strong performance across agentic, reasoning, and coding (ARC) tasks, scoring 70.1% on TAU-Bench, 91.0% on AIME 24, and 64.2% on SWE-bench Verified. With much fewer parameters than several competitors, GLM-4.5 ranks 3rd overall among all evaluated models and 2nd on agentic benchmarks. We release both GLM-4.5 (355B parameters) and a compact version, GLM-4.5-Air (106B parameters), to advance research in reasoning and agentic AI systems. Code, models, and more information are available at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗zai-org/GLM-4.7-Flashmodel· 1.2M dl· ♡ 16441.2M dl♡ 1644
- 🤗unsloth/GLM-4.7-Flash-GGUFmodel· 246k dl· ♡ 586246k dl♡ 586
- 🤗ArliAI/GLM-4.6-Derestricted-v3model· 2.6k dl· ♡ 512.6k dl♡ 51
- 🤗zai-org/GLM-4.7model· 140k dl· ♡ 1949140k dl♡ 1949
- 🤗zai-org/GLM-4.5-Airmodel· 423k dl· ♡ 593423k dl♡ 593
- 🤗zai-org/GLM-4.6model· 22k dl· ♡ 120822k dl♡ 1208
- 🤗zai-org/GLM-4.5-Air-FP8model· 28k dl· ♡ 8028k dl♡ 80
- 🤗ArliAI/GLM-4.5-Air-Derestrictedmodel· 210 dl· ♡ 97210 dl♡ 97
- 🤗zai-org/GLM-4.7-FP8model· 61k dl· ♡ 12061k dl♡ 120
- 🤗unsloth/GLM-4.7-GGUFmodel· 30k dl· ♡ 21530k dl♡ 215
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Rough Sets and Fuzzy Logic · Cognitive Computing and Networks
