Relation-based Counterfactual Data Augmentation and Contrastive Learning   for Robustifying Natural Language Inference Models

Heerin Yang; Sseung-won Hwang; Jungmin So

arXiv:2410.20710·cs.CL·October 29, 2024

Relation-based Counterfactual Data Augmentation and Contrastive Learning for Robustifying Natural Language Inference Models

Heerin Yang, Sseung-won Hwang, Jungmin So

PDF

Open Access

TL;DR

This paper introduces a novel approach combining relation-based counterfactual data augmentation with contrastive learning to enhance the robustness and performance of natural language inference models against non-causal biases.

Contribution

It proposes a new method that generates counterfactual sentence pairs and applies contrastive learning to improve NLI model robustness and understanding of semantic classes.

Findings

01

Improved accuracy on counterfactually-revised datasets

02

Enhanced model robustness to non-causal features

03

Better generalization on standard NLI datasets

Abstract

Although pre-trained language models show good performance on various natural language processing tasks, they often rely on non-causal features and patterns to determine the outcome. For natural language inference tasks, previous results have shown that even a model trained on a large number of data fails to perform well on counterfactually revised data, indicating that the model is not robustly learning the semantics of the classes. In this paper, we propose a method in which we use token-based and sentence-based augmentation methods to generate counterfactual sentence pairs that belong to each class, and apply contrastive learning to help the model learn the difference between sentence pairs of different classes with similar contexts. Evaluation results with counterfactually-revised dataset and general NLI datasets show that the proposed method can improve the performance and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsContrastive Learning