Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence

Jiaxing Zhang; Ruyi Gan; Junjie Wang; Yuxiang Zhang; Lin Zhang; Ping; Yang; Xinyu Gao; Ziwei Wu; Xiaoqun Dong; Junqing He; Jianheng Zhuo; Qi Yang,; Yongfeng Huang; Xiayu Li; Yanghan Wu; Junyu Lu; Xinyu Zhu; Weifeng Chen; Ting; Han; Kunhao Pan; Rui Wang; Hao Wang; Xiaojun Wu; Zhongshen Zeng; Chongpei; Chen

arXiv:2209.02970·cs.CL·March 31, 2023·45 cites

Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence

Jiaxing Zhang, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Lin Zhang, Ping, Yang, Xinyu Gao, Ziwei Wu, Xiaoqun Dong, Junqing He, Jianheng Zhuo, Qi Yang,, Yongfeng Huang, Xiayu Li, Yanghan Wu, Junyu Lu, Xinyu Zhu, Weifeng Chen, Ting, Han, Kunhao Pan, Rui Wang, Hao Wang, Xiaojun Wu

PDF

Open Access 1 Repo 10 Models 3 Datasets

TL;DR

Fengshenbang 1.0 is an open-source project that provides comprehensive Chinese-language foundation models, tools, and benchmarks to support the development of Chinese cognitive intelligence and democratize access to large-scale models.

Contribution

It introduces a complete ecosystem for Chinese foundation models, including models, frameworks, benchmarks, and datasets, fostering community collaboration and resource sharing.

Findings

01

Launch of large pre-trained Chinese models

02

Development of user-friendly APIs and benchmarks

03

Promotion of open-source ecosystem for Chinese AI models

Abstract

Nowadays, foundation models become one of fundamental infrastructures in artificial intelligence, paving ways to the general intelligence. However, the reality presents two urgent challenges: existing foundation models are dominated by the English-language community; users are often given limited resources and thus cannot always use foundation models. To support the development of the Chinese-language community, we introduce an open-source project, called Fengshenbang, which leads by the research center for Cognitive Computing and Natural Language (CCNL). Our project has comprehensive capabilities, including large pre-trained models, user-friendly APIs, benchmarks, datasets, and others. We wrap all these in three sub-projects: the Fengshenbang Model, the Fengshen Framework, and the Fengshen Benchmark. An open-source roadmap, Fengshenbang, aims to re-evaluate the open-source community of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

idea-ccnl/fengshenbang-lm
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Topic Modeling