RoChBert: Towards Robust BERT Fine-tuning for Chinese

Zihan Zhang; Jinfeng Li; Ning Shi; Bo Yuan; Xiangyu Liu; Rong Zhang,; Hui Xue; Donghong Sun; Chao Zhang

arXiv:2210.15944·cs.CL·October 31, 2022

RoChBert: Towards Robust BERT Fine-tuning for Chinese

Zihan Zhang, Jinfeng Li, Ning Shi, Bo Yuan, Xiangyu Liu, Rong Zhang,, Hui Xue, Donghong Sun, Chao Zhang

PDF

Open Access 1 Repo

TL;DR

RoChBERT enhances BERT's robustness for Chinese by integrating phonetic and glyph features through an adversarial graph, combined with curriculum-inspired data augmentation, leading to significant attack resistance without sacrificing accuracy.

Contribution

Introduces RoChBERT, a novel framework that fuses phonetic and glyph features via an adversarial graph and employs curriculum learning for robust Chinese BERT fine-tuning.

Findings

01

Significantly reduces attack success rates by over 39%.

02

Maintains high accuracy of 93.30% on benign texts.

03

Easily adaptable to various models and tasks.

Abstract

Despite of the superb performance on a wide range of tasks, pre-trained language models (e.g., BERT) have been proved vulnerable to adversarial texts. In this paper, we present RoChBERT, a framework to build more Robust BERT-based models by utilizing a more comprehensive adversarial graph to fuse Chinese phonetic and glyph features into pre-trained representations during fine-tuning. Inspired by curriculum learning, we further propose to augment the training dataset with adversarial texts in combination with intermediate samples. Extensive experiments demonstrate that RoChBERT outperforms previous methods in significant ways: (i) robust -- RoChBERT greatly improves the model robustness without sacrificing accuracy on benign texts. Specifically, the defense lowers the success rates of unlimited and limited attacks by 59.43% and 39.33% respectively, while remaining accuracy of 93.30%;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zzh-z/rochbert
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Adversarial Robustness in Machine Learning · Topic Modeling