# CoBA: Counterbias Text Augmentation for Mitigating Various Spurious Correlations via Semantic Triples

**Authors:** Kyohoon Jin, Juhwan Choi, Jungmin Yun, Junho Lee, Soojin Jang, Youngbin Kim

arXiv: 2508.21083 · 2025-11-21

## TL;DR

CoBA introduces a semantic triple-based data augmentation method that mitigates spurious correlations in text models, improving robustness and reducing biases across various tasks.

## Contribution

It proposes a novel counterbias data augmentation framework operating at the semantic triple level to address multiple biases simultaneously.

## Key findings

- Enhanced model robustness to out-of-distribution data
- Significant bias reduction in downstream tasks
- Improved generalization performance

## Abstract

Deep learning models often learn and exploit spurious correlations in training data, using these non-target features to inform their predictions. Such reliance leads to performance degradation and poor generalization on unseen data. To address these limitations, we introduce a more general form of counterfactual data augmentation, termed counterbias data augmentation, which simultaneously tackles multiple biases (e.g., gender bias, simplicity bias) and enhances out-of-distribution robustness. We present CoBA: CounterBias Augmentation, a unified framework that operates at the semantic triple level: first decomposing text into subject-predicate-object triples, then selectively modifying these triples to disrupt spurious correlations. By reconstructing the text from these adjusted triples, CoBA generates counterbias data that mitigates spurious patterns. Through extensive experiments, we demonstrate that CoBA not only improves downstream task performance, but also effectively reduces biases and strengthens out-of-distribution resilience, offering a versatile and robust solution to the challenges posed by spurious correlations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.21083/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/2508.21083/full.md

## References

78 references — full list in the complete paper: https://tomesphere.com/paper/2508.21083/full.md

---
Source: https://tomesphere.com/paper/2508.21083