RoChBert: Towards Robust BERT Fine-tuning for Chinese
Zihan Zhang, Jinfeng Li, Ning Shi, Bo Yuan, Xiangyu Liu, Rong Zhang,, Hui Xue, Donghong Sun, Chao Zhang

TL;DR
RoChBERT enhances BERT's robustness for Chinese by integrating phonetic and glyph features through an adversarial graph, combined with curriculum-inspired data augmentation, leading to significant attack resistance without sacrificing accuracy.
Contribution
Introduces RoChBERT, a novel framework that fuses phonetic and glyph features via an adversarial graph and employs curriculum learning for robust Chinese BERT fine-tuning.
Findings
Significantly reduces attack success rates by over 39%.
Maintains high accuracy of 93.30% on benign texts.
Easily adaptable to various models and tasks.
Abstract
Despite of the superb performance on a wide range of tasks, pre-trained language models (e.g., BERT) have been proved vulnerable to adversarial texts. In this paper, we present RoChBERT, a framework to build more Robust BERT-based models by utilizing a more comprehensive adversarial graph to fuse Chinese phonetic and glyph features into pre-trained representations during fine-tuning. Inspired by curriculum learning, we further propose to augment the training dataset with adversarial texts in combination with intermediate samples. Extensive experiments demonstrate that RoChBERT outperforms previous methods in significant ways: (i) robust -- RoChBERT greatly improves the model robustness without sacrificing accuracy on benign texts. Specifically, the defense lowers the success rates of unlimited and limited attacks by 59.43% and 39.33% respectively, while remaining accuracy of 93.30%;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Adversarial Robustness in Machine Learning · Topic Modeling
