An Empirical Study of Instruction-tuning Large Language Models in   Chinese

Qingyi Si; Tong Wang; Zheng Lin; Xu Zhang; Yanan Cao; Weiping Wang

arXiv:2310.07328·cs.CL·October 23, 2023·1 cites

An Empirical Study of Instruction-tuning Large Language Models in Chinese

Qingyi Si, Tong Wang, Zheng Lin, Xu Zhang, Yanan Cao, Weiping Wang

PDF

Open Access 1 Repo

TL;DR

This paper provides an empirical analysis of instruction-tuning large language models in Chinese, exploring key factors affecting performance and aiming to enhance Chinese LLMs like ChatGPT.

Contribution

It systematically investigates the effects of base models, parameter-efficient methods, and data types on instruction-tuning Chinese LLMs, offering practical insights.

Findings

01

Instruction data type significantly impacts tuning effectiveness.

02

Parameter-efficient methods can achieve competitive performance.

03

Additional factors like chain-of-thought data influence results.

Abstract

The success of ChatGPT validates the potential of large language models (LLMs) in artificial general intelligence (AGI). Subsequently, the release of LLMs has sparked the open-source community's interest in instruction-tuning, which is deemed to accelerate ChatGPT's replication process. However, research on instruction-tuning LLMs in Chinese, the world's most spoken language, is still in its early stages. Therefore, this paper makes an in-depth empirical study of instruction-tuning LLMs in Chinese, which can serve as a cookbook that provides valuable findings for effectively customizing LLMs that can better respond to Chinese instructions. Specifically, we systematically explore the impact of LLM bases, parameter-efficient methods, instruction data types, which are the three most important elements for instruction-tuning. Besides, we also conduct experiment to study the impact of other…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

phoebussi/alpaca-cot
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques