Sibylvariant Transformations for Robust Text Classification

Fabrice Harel-Canada; Muhammad Ali Gulzar; Nanyun Peng; Miryung Kim

arXiv:2205.05137·cs.CL·May 12, 2022

Sibylvariant Transformations for Robust Text Classification

Fabrice Harel-Canada, Muhammad Ali Gulzar, Nanyun Peng, Miryung Kim

PDF

Open Access 1 Repo

TL;DR

This paper introduces sibylvariance, a new class of NLP data transformations that relax label-preserving constraints, creating more diverse inputs to improve model robustness and generalization.

Contribution

The paper proposes a unified framework for sibylvariance transformations, including novel techniques like Concept2Sentence and SentMix, enhancing NLP model robustness.

Findings

01

Improved generalization performance across six datasets

02

Enhanced defect detection capabilities

03

Increased adversarial robustness

Abstract

The vast majority of text transformation techniques in NLP are inherently limited in their ability to expand input space coverage due to an implicit constraint to preserve the original class label. In this work, we propose the notion of sibylvariance (SIB) to describe the broader set of transforms that relax the label-preserving constraint, knowably vary the expected class, and lead to significantly more diverse input distributions. We offer a unified framework to organize all data transformations, including two types of SIB: (1) Transmutations convert one discrete kind into another, (2) Mixture Mutations blend two or more classes together. To explore the role of sibylvariance within NLP, we implemented 41 text transformations, including several novel techniques like Concept2Sentence and SentMix. Sibylvariance also enables a unique form of adaptive training that generates new input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucla-seal/sibyl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques