Linking convolutional kernel size to generalization bias in face analysis CNNs
Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha, Balakrishnan

TL;DR
This paper investigates how the choice of convolutional kernel size in CNNs causally influences face analysis bias across subpopulations, revealing that architectural hyperparameters can induce significant out-of-distribution biases.
Contribution
It introduces a causal framework linking CNN hyperparameters, specifically kernel size, to algorithmic bias, highlighting the impact of architectural choices on generalization across subpopulations.
Findings
Kernel size affects frequency content of learned features.
Modifying kernel size induces bias in face analysis performance.
Bias persists even with balanced datasets.
Abstract
Training dataset biases are by far the most scrutinized factors when explaining algorithmic biases of neural networks. In contrast, hyperparameters related to the neural network architecture have largely been ignored even though different network parameterizations are known to induce different implicit biases over learned features. For example, convolutional kernel size is known to affect the frequency content of features learned in CNNs. In this work, we present a causal framework for linking an architectural hyperparameter to out-of-distribution algorithmic bias. Our framework is experimental, in that we train several versions of a network with an intervention to a specific hyperparameter, and measure the resulting causal effect of this choice on performance bias when a particular out-of-distribution image perturbation is applied. In our experiments, we focused on measuring the causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Linking Convolutional Kernel Size to Generalization Bias in Face Analysis CNNs· youtube
Taxonomy
TopicsFace recognition and analysis · Domain Adaptation and Few-Shot Learning · AI in cancer detection
