PairCFR: Enhancing Model Training on Paired Counterfactually Augmented   Data through Contrastive Learning

Xiaoqi Qiu; Yongjie Wang; Xu Guo; Zhiwei Zeng; Yue Yu; Yuhong Feng,; Chunyan Miao

arXiv:2406.06633·cs.LG·June 12, 2024

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Xiaoqi Qiu, Yongjie Wang, Xu Guo, Zhiwei Zeng, Yue Yu, Yuhong Feng,, Chunyan Miao

PDF

Open Access 1 Repo 1 Video

TL;DR

PairCFR introduces a contrastive learning approach to improve model training on counterfactually augmented data, enhancing robustness and reducing bias by promoting broader feature utilization.

Contribution

This work is the first to integrate contrastive learning with counterfactually augmented data to mitigate bias and improve out-of-distribution performance.

Findings

01

Outperforms state-of-the-art on OOD datasets

02

Encourages models to use a wider range of features

03

Theoretically proves contrastive loss promotes broader feature use

Abstract

Counterfactually Augmented Data (CAD) involves creating new data samples by applying minimal yet sufficient modifications to flip the label of existing data samples to other classes. Training with CAD enhances model robustness against spurious features that happen to correlate with labels by spreading the casual relationships across different classes. Yet, recent research reveals that training with CAD may lead models to overly focus on modified features while ignoring other important contextual information, inadvertently introducing biases that may impair performance on out-ofdistribution (OOD) datasets. To mitigate this issue, we employ contrastive learning to promote global feature alignment in addition to learning counterfactual clues. We theoretically prove that contrastive loss can encourage models to leverage a broader range of features beyond those modified ones. Comprehensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Siki-cloud/PairCFR
pytorchOfficial

Videos

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning· underline

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsFLIP · Contrastive Learning · Focus