Shortcut Invariance: Targeted Jacobian Regularization in Disentangled Latent Space

Shivam Pal; Sakshi Varshney; Piyush Rai

arXiv:2511.19525·cs.LG·March 10, 2026

Shortcut Invariance: Targeted Jacobian Regularization in Disentangled Latent Space

Shivam Pal, Sakshi Varshney, Piyush Rai

PDF

Open Access

TL;DR

This paper introduces a novel latent-space regularization technique that reduces neural network reliance on shortcut features by injecting targeted anisotropic noise, improving out-of-distribution generalization without needing shortcut labels.

Contribution

The authors propose a Jacobian regularization method in disentangled latent space that preserves full representations while reducing shortcut reliance, enhancing OOD robustness.

Findings

01

Achieves state-of-the-art OOD performance on shortcut-learning benchmarks.

02

Does not require shortcut labels or conflicting samples for training.

03

Effectively flattens decision boundaries along shortcut axes.

Abstract

Deep neural networks are prone to learning shortcuts, spurious correlations present in the training data that undermine out-of-distribution (OOD) generalization. Most prior work mitigates shortcut learning through input-space reweighting, either relying on explicit shortcut labels or inferring shortcut structure from heuristics such as per-sample loss. Moreover, these approaches typically assume the presence of some shortcut-conflicting examples in the training set, an assumption that is often violated in practice, particularly in medical imaging where data is aggregated across institutions with different acquisition protocols. We propose a latent-space method that views shortcut learning as over-reliance on shortcut-aligned axes. In a disentangled latent space, we identify candidate shortcut-aligned axes via their strong correlation with labels and reduce classifier reliance on them…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis