eXplaining to Learn (eX2L): Regularization Using Contrastive Visual Explanation Pairs for Distribution Shifts

Paulo Mario P. Medina; Jose Marie Antonio Mi\~noza; Sebastian C. Iba\~nez

arXiv:2605.06368·cs.CV·May 8, 2026

eXplaining to Learn (eX2L): Regularization Using Contrastive Visual Explanation Pairs for Distribution Shifts

Paulo Mario P. Medina, Jose Marie Antonio Mi\~noza, Sebastian C. Iba\~nez

PDF

TL;DR

eX2L introduces an interpretable regularization method that uses contrastive visual explanation pairs to improve robustness against distribution shifts by decorrelating confounding features.

Contribution

It presents a novel explanation-based framework that explicitly decouples label and nuisance attributes during training to enhance distributional robustness.

Findings

01

eX2L achieves 82.24% average accuracy on the Spawrious benchmark.

02

It outperforms state-of-the-art methods by 5.49% in average accuracy.

03

eX2L improves worst-group accuracy by 10.90%.

Abstract

Despite extensive research into mitigating distribution shifts, many existing algorithms yield inconsistent performance, often failing to outperform baseline Empirical Risk Minimization (ERM) across diverse scenarios. Furthermore, high algorithmic complexity frequently limits interpretability and offers only an indirect means of addressing spurious correlations. We propose eXplaining to Learn (eX2L): an interpretable, explanation-based framework that decorrelates confounding features from a classifier's latent representations during training. eX2L achieves this by penalizing the similarity between Grad-CAM activation maps generated by a primary label classifier and those from a concurrently trained confounder classifier. On the rigorous Spawrious Many-to-Many Hard Challenge benchmark, eX2L achieves an average accuracy (AA) of 82.24% +/- 3.87% and a worst-group accuracy (WGA) of 66.31%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.