TCBERT: A Technical Report for Chinese Topic Classification BERT
Ting Han, Kunhao Pan, Xinyu Chen, Dingjie Song, Yuchen Fan, Xinyu Gao,, Ruyi Gan, Jiaxing Zhang

TL;DR
This paper introduces TCBERT, a Chinese BERT variant optimized for topic classification, achieved through supervised continued pre-training with prompt and contrastive learning on a large Chinese dataset.
Contribution
It presents a novel Chinese BERT model specifically pre-trained for topic classification, incorporating prompt and contrastive learning techniques.
Findings
Open-sourced TCBERT models with various sizes
Enhanced performance on Chinese topic classification tasks
Large-scale Chinese dataset for pre-training
Abstract
Bidirectional Encoder Representations from Transformers or BERT~\cite{devlin-etal-2019-bert} has been one of the base models for various NLP tasks due to its remarkable performance. Variants customized for different languages and tasks are proposed to further improve the performance. In this work, we investigate supervised continued pre-training~\cite{gururangan-etal-2020-dont} on BERT for Chinese topic classification task. Specifically, we incorporate prompt-based learning and contrastive learning into the pre-training. To adapt to the task of Chinese topic classification, we collect around 2.1M Chinese data spanning various topics. The pre-trained Chinese Topic Classification BERTs (TCBERTs) with different parameter sizes are open-sourced at \url{https://huggingface.co/IDEA-CCNL}.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗IDEA-CCNL/Erlangshen-TCBert-110M-Classification-Chinesemodel· 9 dl· ♡ 19 dl♡ 1
- 🤗IDEA-CCNL/Erlangshen-TCBert-110M-Sentence-Embedding-Chinesemodel· 15 dl· ♡ 515 dl♡ 5
- 🤗IDEA-CCNL/Erlangshen-TCBert-330M-Classification-Chinesemodel· 7 dl· ♡ 17 dl♡ 1
- 🤗IDEA-CCNL/Erlangshen-TCBert-1.3B-Classification-Chinesemodel· 2 dl· ♡ 12 dl♡ 1
- 🤗IDEA-CCNL/Erlangshen-TCBert-1.3B-Sentence-Embedding-Chinesemodel· 1 dl· ♡ 81 dl♡ 8
- 🤗IDEA-CCNL/Erlangshen-TCBert-330M-Sentence-Embedding-Chinesemodel· 4 dl· ♡ 94 dl♡ 9
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Weight Decay · Adam · Linear Layer · Dense Connections · Residual Connection · Attention Dropout · Balanced Selection · Attention Is All You Need
