TeleChat Technical Report
Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao,, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang,, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng, Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang

TL;DR
TeleChat introduces a series of large language models with 3B, 7B, and 12B parameters, trained on diverse multilingual data, fine-tuned for human preferences, and evaluated across multiple tasks, with models and resources released publicly.
Contribution
The paper presents a new set of multilingual LLMs with detailed training and fine-tuning methodology, and releases models and data for community use.
Findings
TeleChat models perform comparably to similar-sized open-source models.
Models are effective across language understanding, reasoning, and code generation tasks.
Public release of models and resources supports further research.
Abstract
In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, including trillions of tokens. Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe. We evaluate the performance of TeleChat on various tasks, including language understanding, mathematics, reasoning, code generation, and knowledge-based question answering. Our findings indicate that TeleChat achieves comparable performance to other open-source models of similar size across a wide range of public benchmarks. To support…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Tele-AI/telechat-7Bmodel· 206 dl· ♡ 108206 dl♡ 108
- 🤗Tele-AI/telechat-7B-int8model· 16 dl· ♡ 8116 dl♡ 81
- 🤗Tele-AI/telechat-7B-int4model· 13 dl· ♡ 7613 dl♡ 76
- 🤗Tele-AI/TeleChat-12B-int8model· 9 dl· ♡ 29 dl♡ 2
- 🤗Tele-AI/TeleChat-12Bmodel· 152 dl· ♡ 11152 dl♡ 11
- 🤗Tele-AI/TeleChat-12B-int4model· 7 dl· ♡ 27 dl♡ 2
- 🤗Tele-AI/TeleChat-12B-v2model· 15 dl· ♡ 115 dl♡ 1
- 🤗Tele-AI/TeleChat-52Bmodel· 20 dl· ♡ 220 dl♡ 2
- 🤗Tele-AI/TeleChat-1Bmodel· 16 dl· ♡ 116 dl♡ 1
- 🤗chuhac/TeleChat2-115Bmodel· 1 dl· ♡ 31 dl♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsALIGN
