Generating Authentic Adversarial Examples beyond Meaning-preserving with   Doubly Round-trip Translation

Siyu Lai; Zhen Yang; Fandong Meng; Xue Zhang; Yufeng Chen; Jinan Xu; and Jie Zhou

arXiv:2204.08689·cs.CL·April 20, 2022

Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation

Siyu Lai, Zhen Yang, Fandong Meng, Xue Zhang, Yufeng Chen, Jinan Xu, and Jie Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces Doubly Round-trip Translation (DRTT) to generate authentic adversarial examples for NMT, improving model robustness by better identifying adversarial samples and using them for training.

Contribution

It proposes DRTT as a new criterion for generating adversarial examples, addressing ambiguity in single RTT methods, and leverages bilingual adversarial pairs to enhance NMT robustness.

Findings

01

Significant robustness improvements on clean and noisy test sets.

02

Effective identification of authentic adversarial examples.

03

Enhanced NMT performance with DRTT-based training.

Abstract

Generating adversarial examples for Neural Machine Translation (NMT) with single Round-Trip Translation (RTT) has achieved promising results by releasing the meaning-preserving restriction. However, a potential pitfall for this approach is that we cannot decide whether the generated examples are adversarial to the target NMT model or the auxiliary backward one, as the reconstruction error through the RTT can be related to either. To remedy this problem, we propose a new criterion for NMT adversarial examples based on the Doubly Round-Trip Translation (DRTT). Specifically, apart from the source-target-source RTT, we also consider the target-source-target one, which is utilized to pick out the authentic adversarial examples for the target NMT model. Additionally, to enhance the robustness of the NMT model, we introduce the masked language models to construct bilingual adversarial pairs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lisasiyu/drtt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Adversarial Robustness in Machine Learning