Is Last Layer Re-Training Truly Sufficient for Robustness to Spurious   Correlations?

Phuong Quynh Le; J\"org Schl\"otterer; Christin Seifert

arXiv:2308.00473·cs.LG·January 11, 2024·1 cites

Is Last Layer Re-Training Truly Sufficient for Robustness to Spurious Correlations?

Phuong Quynh Le, J\"org Schl\"otterer, Christin Seifert

PDF

Open Access

TL;DR

This paper investigates the effectiveness of last-layer retraining methods like DFR in improving model robustness against spurious correlations, especially in medical data, and finds that such methods have limitations.

Contribution

The study evaluates DFR's applicability to medical data and analyzes why last-layer retraining may not fully eliminate reliance on spurious features.

Findings

01

DFR improves worst-group accuracy but remains susceptible to spurious correlations.

02

Last-layer retraining alone may be insufficient for robustness in realistic data.

03

Analysis explains the limitations of DFR in addressing spurious features.

Abstract

Models trained with empirical risk minimization (ERM) are known to learn to rely on spurious features, i.e., their prediction is based on undesired auxiliary features which are strongly correlated with class labels but lack causal reasoning. This behavior particularly degrades accuracy in groups of samples of the correlated class that are missing the spurious feature or samples of the opposite class but with the spurious feature present. The recently proposed Deep Feature Reweighting (DFR) method improves accuracy of these worst groups. Based on the main argument that ERM mods can learn core features sufficiently well, DFR only needs to retrain the last layer of the classification model with a small group-balanced data set. In this work, we examine the applicability of DFR to realistic data in the medical domain. Furthermore, we investigate the reasoning behind the effectiveness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Machine Learning and Data Classification