Technical Report of TeleChat2, TeleChat2.5 and T1

Zihan Wang; Xinzhang Liu; Yitong Yao; Chao Wang; Yu Zhao; Zhihao Yang; Wenmin Deng; Kaipeng Jia; Jiaxin Peng; Yuyao Huang; Sishi Xiong; Zhuo Jiang; Kaidong Yu; Xiaohui Hu; Fubei Yao; Ruiyu Fang; Zhuoru Jiang; Ruiting Song; Qiyi Xie; Rui Xue; Xuewei He; Yanlei Xue; Zhu Yuan; Zhaoxi Zhang; Zilu Huang; Shiquan Wang; Xin Wang; Hanming Wu; Mingyuan Wang; Xufeng Zhan; Yuhan Sun; Zhaohu Xing; Yuhao Jiang; Bingkai Yang; Shuangyong Song; Yongxiang Li; Zhongjiang He; Xuelong Li

arXiv:2507.18013·cs.CL·July 30, 2025

Technical Report of TeleChat2, TeleChat2.5 and T1

Zihan Wang, Xinzhang Liu, Yitong Yao, Chao Wang, Yu Zhao, Zhihao Yang, Wenmin Deng, Kaipeng Jia, Jiaxin Peng, Yuyao Huang, Sishi Xiong, Zhuo Jiang, Kaidong Yu, Xiaohui Hu, Fubei Yao, Ruiyu Fang, Zhuoru Jiang, Ruiting Song, Qiyi Xie, Rui Xue, Xuewei He, Yanlei Xue, Zhu Yuan

PDF

Open Access 10 Models

TL;DR

This paper introduces TeleChat2, TeleChat2.5, and T1, a series of advanced language models with improved training strategies, larger datasets, and specialized capabilities for reasoning and speed, outperforming some proprietary models.

Contribution

The paper presents a new series of TeleChat models with enhanced training methods, domain-specific pretraining, and reinforcement learning, achieving superior performance in reasoning and coding tasks.

Findings

01

T1-115B outperforms GPT-4o and proprietary models.

02

TeleChat2.5 offers rapid inference for real-time applications.

03

Models are publicly released for research and development.

Abstract

We introduce the latest series of TeleChat models: \textbf{TeleChat2}, \textbf{TeleChat2.5}, and \textbf{T1}, offering a significant upgrade over their predecessor, TeleChat. Despite minimal changes to the model architecture, the new series achieves substantial performance gains through enhanced training strategies in both pre-training and post-training stages. The series begins with \textbf{TeleChat2}, which undergoes pretraining on 10 trillion high-quality and diverse tokens. This is followed by Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) to further enhance its capabilities. \textbf{TeleChat2.5} and \textbf{T1} expand the pipeline by incorporating a continual pretraining phase with domain-specific datasets, combined with reinforcement learning (RL) to improve performance in code generation and mathematical reasoning tasks. The \textbf{T1} variant is designed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)