InfoBERT: Improving Robustness of Language Models from An Information   Theoretic Perspective

Boxin Wang; Shuohang Wang; Yu Cheng; Zhe Gan; Ruoxi Jia; Bo Li,; Jingjing Liu

arXiv:2010.02329·cs.CL·March 23, 2021·61 cites

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li,, Jingjing Liu

PDF

Open Access 2 Repos 1 Video

TL;DR

InfoBERT introduces an information-theoretic framework with mutual-information regularizers to enhance the robustness of language models against adversarial attacks, achieving state-of-the-art results in NLP tasks.

Contribution

It proposes a novel mutual-information-based regularization framework for robust fine-tuning of language models, combining an information bottleneck and robust feature regularizers.

Findings

01

Achieves state-of-the-art adversarial robustness on NLI and QA datasets.

02

Effectively suppresses noisy information in representations.

03

Enhances mutual information between robust features and global features.

Abstract

Large-scale language models such as BERT have achieved state-of-the-art performance across a wide range of NLP tasks. Recent studies, however, show that such BERT-based models are vulnerable facing the threats of textual adversarial attacks. We aim to address this problem from an information-theoretic perspective, and propose InfoBERT, a novel learning framework for robust fine-tuning of pre-trained language models. InfoBERT contains two mutual-information-based regularizers for model training: (i) an Information Bottleneck regularizer, which suppresses noisy mutual information between the input and the feature representation; and (ii) a Robust Feature regularizer, which increases the mutual information between local robust features and global features. We provide a principled way to theoretically analyze and improve the robustness of representation learning for language models in both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective· slideslive

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Natural Language Processing Techniques

MethodsLinear Layer · Dense Connections · Layer Normalization · WordPiece · Multi-Head Attention · Dropout · Linear Warmup With Linear Decay · Attention Dropout · Weight Decay · Attention Is All You Need