TeleChat Technical Report

Zhongjiang He; Zihan Wang; Xinzhang Liu; Shixuan Liu; Yitong Yao,; Yuyao Huang; Xuelong Li; Yongxiang Li; Zhonghao Che; Zhaoxi Zhang; Yan Wang,; Xin Wang; Luwen Pu; Huinan Xu; Ruiyu Fang; Yu Zhao; Jie Zhang; Xiaomeng; Huang; Zhilong Lu; Jiaxin Peng; Wenjun Zheng; Shiquan Wang; Bingkai Yang,; Xuewei he; Zhuoru Jiang; Qiyi Xie; Yanhan Zhang; Zhongqiu Li; Lingling Shi,; Weiwei Fu; Yin Zhang; Zilu Huang; Sishi Xiong; Yuxiang Zhang; Chao Wang,; Shuangyong Song

arXiv:2401.03804·cs.CL·April 3, 2024·2 cites

TeleChat Technical Report

Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao,, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang,, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng, Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang

PDF

Open Access 10 Models 1 Datasets

TL;DR

TeleChat introduces a series of large language models with 3B, 7B, and 12B parameters, trained on diverse multilingual data, fine-tuned for human preferences, and evaluated across multiple tasks, with models and resources released publicly.

Contribution

The paper presents a new set of multilingual LLMs with detailed training and fine-tuning methodology, and releases models and data for community use.

Findings

01

TeleChat models perform comparably to similar-sized open-source models.

02

Models are effective across language understanding, reasoning, and code generation tasks.

03

Public release of models and resources supports further research.

Abstract

In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, including trillions of tokens. Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe. We evaluate the performance of TeleChat on various tasks, including language understanding, mathematics, reasoning, code generation, and knowledge-based question answering. Our findings indicate that TeleChat achieves comparable performance to other open-source models of similar size across a wide range of public benchmarks. To support…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Tele-AI/TeleChat-PTD
dataset· 575 dl
575 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsALIGN