Precisely the Point: Adversarial Augmentations for Faithful and   Informative Text Generation

Wenhao Wu; Wei Li; Jiachen Liu; Xinyan Xiao; Sujian Li; Yajuan Lyu

arXiv:2210.12367·cs.CL·October 25, 2022

Precisely the Point: Adversarial Augmentations for Faithful and Informative Text Generation

Wenhao Wu, Wei Li, Jiachen Liu, Xinyan Xiao, Sujian Li, Yajuan Lyu

PDF

Open Access

TL;DR

This paper analyzes the robustness of pre-trained Seq2Seq models like BART, finds vulnerabilities affecting faithfulness and informativeness, and introduces AdvSeq, an adversarial augmentation framework that significantly enhances these qualities.

Contribution

It provides the first quantitative analysis of Seq2Seq robustness and proposes a novel adversarial augmentation method, AdvSeq, to improve faithfulness and informativeness.

Findings

01

AdvSeq improves faithfulness in text generation.

02

AdvSeq enhances informativeness of Seq2Seq models.

03

Experimental results show significant gains in robustness.

Abstract

Though model robustness has been extensively studied in language understanding, the robustness of Seq2Seq generation remains understudied. In this paper, we conduct the first quantitative analysis on the robustness of pre-trained Seq2Seq models. We find that even current SOTA pre-trained Seq2Seq model (BART) is still vulnerable, which leads to significant degeneration in faithfulness and informativeness for text generation tasks. This motivated us to further propose a novel adversarial augmentation framework, namely AdvSeq, for generally improving faithfulness and informativeness of Seq2Seq models via enhancing their robustness. AdvSeq automatically constructs two types of adversarial augmentations during training, including implicit adversarial samples by perturbing word representations and explicit adversarial samples by word swapping, both of which effectively improve Seq2Seq…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Adversarial Robustness in Machine Learning

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence