Model Patching: Closing the Subgroup Performance Gap with Data Augmentation
Karan Goel, Albert Gu, Yixuan Li, Christopher R\'e

TL;DR
This paper introduces model patching, a two-stage data augmentation framework that improves classifier robustness by reducing subgroup performance disparities, demonstrated on multiple datasets including skin cancer classification.
Contribution
The paper proposes a novel model patching framework using CycleGAN-based augmentations and a subgroup regularizer to enhance model invariance and robustness.
Findings
Up to 33% reduction in robust error on benchmark datasets.
Effective patching of a skin cancer classifier with spurious features.
Demonstrated improvement in subgroup performance consistency.
Abstract
Classifiers in machine learning are often brittle when deployed. Particularly concerning are models with inconsistent performance on specific subgroups of a class, e.g., exhibiting disparities in skin cancer classification in the presence or absence of a spurious bandage. To mitigate these performance differences, we introduce model patching, a two-stage framework for improving robustness that encourages the model to be invariant to subgroup differences, and focus on class information shared by subgroups. Model patching first models subgroup features within a class and learns semantic transformations between them, and then trains a classifier with data augmentations that deliberately manipulate subgroup features. We instantiate model patching with CAMEL, which (1) uses a CycleGAN to learn the intra-class, inter-subgroup augmentations, and (2) balances subgroup performance using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Software Testing and Debugging Techniques · Software System Performance and Reliability
MethodsConvolution · PatchGAN · GAN Least Squares Loss · Tanh Activation · Cycle Consistency Loss · *Communicated@Fast*How Do I Communicate to Expedia? · Instance Normalization · HuMan(Expedia)||How do I get a human at Expedia? · Batch Normalization · Residual Connection
