Adjusting Pretrained Backbones for Performativity

Berker Demirel; Lingjing Kong; Kun Zhang; Theofanis Karaletsos,; Celestine Mendler-D\"unner; Francesco Locatello

arXiv:2410.04499·cs.LG·October 8, 2024

Adjusting Pretrained Backbones for Performativity

Berker Demirel, Lingjing Kong, Kun Zhang, Theofanis Karaletsos,, Celestine Mendler-D\"unner, Francesco Locatello

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces a modular method to adapt pretrained deep learning models for performativity, improving efficiency and reusability in environments with distribution shifts, especially under label shift conditions.

Contribution

It proposes a shallow adapter module for pretrained backbones to correct performative label shift without altering input dimensions, enabling better sample efficiency and model reuse.

Findings

01

Reduces loss during retraining in adversarial sampling scenarios

02

Enables effective model selection to anticipate performance degradation

03

Provides a baseline for addressing performativity in deep learning

Abstract

With the widespread deployment of deep learning models, they influence their environment in various ways. The induced distribution shifts can lead to unexpected performance degradation in deployed models. Existing methods to anticipate performativity typically incorporate information about the deployed model into the feature vector when predicting future outcomes. While enjoying appealing theoretical properties, modifying the input dimension of the prediction task is often not practical. To address this, we propose a novel technique to adjust pretrained backbones for performativity in a modular way, achieving better sample efficiency and enabling the reuse of existing deep learning assets. Focusing on performative label shift, the key idea is to train a shallow adapter module to perform a Bayes-optimal label shift correction to the backbone's logits given a sufficient statistic of the…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 6Confidence 2

Strengths

Originality: The paper introduces two novel contributions: a practical baseline for handling performative label shift and a module to predict model performativity. These aspects offer a fresh approach in this field. Quality: The paper is well-executed, with extensive experiments across several datasets. Baselines are clearly presented, and the rationale behind the choices is well-explained, making the contribution and results robust and grounded. Significance: Although I am not familiar with t

Weaknesses

I must admit I'm having difficulty understanding certain aspects of this article. Firstly, regarding the evaluation metric in Lines 362-363: what exactly is the 'retraining trajectory'? Is this term defined within this article, or does it originate from previous work? Additionally, how is this definition of 'Acc' distinct from 'Acc in S'? This difference significantly impacts my interpretation of the results in Figures 2 and 3. Similarly, in Algorithm 1, does 'Deploy' imply training on the samp

Reviewer 02Rating 6Confidence 3

Strengths

This paper addressed an important problem of performativity for deployed deep learning models given post deployment distribution shifts, and proposed a plug-and-play adapter module to be used with pre-trained backbones. The authors have formulated this interesting problem and conducted various experiments demonstrating the effectiveness of their method.

Weaknesses

1. It's a bit hard to understand the proposed method given the writing, which needs to be improved particularly at places critical for elaborating the core module and results. For example, reading through section 1.1 and section 3, it's unclear what "sufficient statistic" refers to, which is an important inputs to the proposed adapter module. It's not until in later sections the authors gave examples for statistic such as class accuracies. Another example, reading through p. 7 lines 324-328, I s

Reviewer 03Rating 3Confidence 4

Strengths

The paper idea is simple and reasonable. The overall framework is efficient and easy to deploy. Compared to without adapdation, the model significantly improved the performance.

Weaknesses

1. Some of the definitions in the paper is clear, which increased the difficulty to understand the paper. 2 The benchmark dataset setting is not clearly stated.

Code & Models

Repositories

berkerdemirel/Adjusting-Pretrained-Backbones-for-Performativity
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimedia Communication and Technology

MethodsAdapter