Pay Attention to the Robustness of Chinese Minority Language Models!   Syllable-level Textual Adversarial Attack on Tibetan Script

Xi Cao; Dolma Dawa; Nuo Qun; Trashi Nyima

arXiv:2412.02323·cs.CL·December 5, 2024

Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script

Xi Cao, Dolma Dawa, Nuo Qun, Trashi Nyima

PDF

1 Repo

TL;DR

This paper introduces TSAttacker, a syllable-level black-box adversarial attack method for Tibetan language models, revealing their vulnerability and highlighting the need for improved robustness in Chinese minority language NLP models.

Contribution

It proposes a novel syllable-level adversarial attack method for Tibetan, addressing a gap in research on Chinese minority languages and evaluating model robustness.

Findings

01

TSAttacker effectively generates high-quality adversarial samples.

02

Models show significant vulnerability to the attack.

03

Robustness of Tibetan language models needs improvement.

Abstract

The textual adversarial attack refers to an attack method in which the attacker adds imperceptible perturbations to the original texts by elaborate design so that the NLP (natural language processing) model produces false judgments. This method is also used to evaluate the robustness of NLP models. Currently, most of the research in this field focuses on English, and there is also a certain amount of research on Chinese. However, to the best of our knowledge, there is little research targeting Chinese minority languages. Textual adversarial attacks are a new challenge for the information processing of Chinese minority languages. In response to this situation, we propose a Tibetan syllable-level black-box textual adversarial attack called TSAttacker based on syllable cosine distance and scoring mechanism. And then, we conduct TSAttacker on six models generated by fine-tuning two PLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

utibetnlp/tsattacker
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.