Reinforcing Pre-trained Models Using Counterfactual Images

Xiang Li; Ren Togo; Keisuke Maeda; Takahiro Ogawa; Miki Haseyama

arXiv:2406.13316·cs.CV·June 21, 2024

Reinforcing Pre-trained Models Using Counterfactual Images

Xiang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama

PDF

Open Access

TL;DR

This paper introduces a framework that uses language-guided generated counterfactual images to identify and reinforce classification models, improving their robustness by fine-tuning with these counterfactuals.

Contribution

It presents a novel two-stage process for identifying model weaknesses and reinforcing models using counterfactual images generated from perturbed captions.

Findings

01

Fine-tuning with counterfactual images improves model robustness.

02

The framework effectively identifies vulnerabilities in classification models.

03

Experiments show significant performance gains across datasets.

Abstract

This paper proposes a novel framework to reinforce classification models using language-guided generated counterfactual images. Deep learning classification models are often trained using datasets that mirror real-world scenarios. In this training process, because learning is based solely on correlations with labels, there is a risk that models may learn spurious relationships, such as an overreliance on features not central to the subject, like background elements in images. However, due to the black-box nature of the decision-making process in deep learning models, identifying and addressing these vulnerabilities has been particularly challenging. We introduce a novel framework for reinforcing the classification models, which consists of a two-stage process. First, we identify model weaknesses by testing the model using the counterfactual image dataset, which is generated by perturbed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis

MethodsSparse Evolutionary Training