On the Foundations of Shortcut Learning
Katherine L. Hermann, Hossein Mobahi, Thomas Fel, Michael C. Mozer

TL;DR
This paper investigates how the availability and predictivity of features influence shortcut learning in deep models, revealing that nonlinear architectures tend to favor more available, less predictive features, which impacts their decision-making.
Contribution
It introduces a minimal generative framework to analyze feature bias and demonstrates how nonlinear models exhibit shortcut bias influenced by feature availability.
Findings
Linear models are relatively unbiased.
Nonlinear models with ReLU or Tanh units show shortcut bias.
Availability manipulations can increase shortcut bias in natural datasets.
Abstract
Deep-learning models can extract a rich assortment of features from data. Which features a model uses depends not only on \emph{predictivity} -- how reliably a feature indicates training-set labels -- but also on \emph{availability} -- how easily the feature can be extracted from inputs. The literature on shortcut learning has noted examples in which models privilege one feature over another, for example texture over shape and image backgrounds over foreground objects. Here, we test hypotheses about which input properties are more available to a model, and systematically study how predictivity and availability interact to shape models' feature use. We construct a minimal, explicit generative framework for synthesizing classification datasets with two latent features that vary in predictivity and in factors we hypothesize to relate to availability, and we quantify a model's shortcut bias…
Peer Reviews
Decision·ICLR 2024 spotlight
The paper is well-written and the main message that availability is a key driver to shortcut learning is convincingly conveyed. Both empirical support and theoretical support are provided to underline the effect of availability on shortcut learning.
The theoretical analysis is limited to a single hidden layer MLP which does not reflect the type of architectures used in practice. It is difficult to conclude whether this also holds for other types of architectures like Transformers or CNNs. Experiments are limited to supervised classification settings. Self-supervised training or other tasks like object detection would be interesting to consider in the context of shortcut learning.
- S1. The paper deals with an important aspect of understanding neural networks, the bias for learning shortcuts. - S2. Using the notion of shortcut bias, based on the reliance of an optimal predictor is a good idea. - S3. Analysing based on a notion of availability, that can be computed produces some good observations. - S4. Interesting to see that theoretically, ReLU networks are more biased than linear networks.
- W1. The notion of availability is very closely connected with the concept of simplicity of neural networks. The simplicity bias has been pointed out as a cause of non-robust learning. The paper gives multiple references (e.g. Shah et al., 2020, etc.) for works dealing with simplicity bias, but they are not discussed in detail. The authors must clarify what is the difference between the notion of availability presented in this work, and the simplicity bias previously introduced. What new observ
* The paper is well-written and easy to follow. The problem of shortcut learning, which is not extensively studied, is important to understand. * The paper presents interesting insights into neural networks’ preference toward shortcuts. The paper studies shortcut learning empirically using controlled datasets and theoretically using the NTK. * The derivation for the bias of linear and ReLU networks using NTK would be helpful for future work in this domain.
* The main observation that the model depth and non-linearity increase bias towards the shortcut features is intuitive. Both depth and non-linearity allows the model to learn a rich representation. * Theoretical analysis using NTK is problematic as they don't learn feature and thus cannot necessarily explain model's preference towards the shortcut feature. * Only vision tasks are explored in the paper, it is not clear if the observations will hold true for other domains. * It would be intere
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Neural Networks and Applications
