Adjusting Pretrained Backbones for Performativity
Berker Demirel, Lingjing Kong, Kun Zhang, Theofanis Karaletsos,, Celestine Mendler-D\"unner, Francesco Locatello

TL;DR
This paper introduces a modular method to adapt pretrained deep learning models for performativity, improving efficiency and reusability in environments with distribution shifts, especially under label shift conditions.
Contribution
It proposes a shallow adapter module for pretrained backbones to correct performative label shift without altering input dimensions, enabling better sample efficiency and model reuse.
Findings
Reduces loss during retraining in adversarial sampling scenarios
Enables effective model selection to anticipate performance degradation
Provides a baseline for addressing performativity in deep learning
Abstract
With the widespread deployment of deep learning models, they influence their environment in various ways. The induced distribution shifts can lead to unexpected performance degradation in deployed models. Existing methods to anticipate performativity typically incorporate information about the deployed model into the feature vector when predicting future outcomes. While enjoying appealing theoretical properties, modifying the input dimension of the prediction task is often not practical. To address this, we propose a novel technique to adjust pretrained backbones for performativity in a modular way, achieving better sample efficiency and enabling the reuse of existing deep learning assets. Focusing on performative label shift, the key idea is to train a shallow adapter module to perform a Bayes-optimal label shift correction to the backbone's logits given a sufficient statistic of the…
Peer Reviews
Decision·Submitted to ICLR 2025
Originality: The paper introduces two novel contributions: a practical baseline for handling performative label shift and a module to predict model performativity. These aspects offer a fresh approach in this field. Quality: The paper is well-executed, with extensive experiments across several datasets. Baselines are clearly presented, and the rationale behind the choices is well-explained, making the contribution and results robust and grounded. Significance: Although I am not familiar with t
I must admit I'm having difficulty understanding certain aspects of this article. Firstly, regarding the evaluation metric in Lines 362-363: what exactly is the 'retraining trajectory'? Is this term defined within this article, or does it originate from previous work? Additionally, how is this definition of 'Acc' distinct from 'Acc in S'? This difference significantly impacts my interpretation of the results in Figures 2 and 3. Similarly, in Algorithm 1, does 'Deploy' imply training on the samp
This paper addressed an important problem of performativity for deployed deep learning models given post deployment distribution shifts, and proposed a plug-and-play adapter module to be used with pre-trained backbones. The authors have formulated this interesting problem and conducted various experiments demonstrating the effectiveness of their method.
1. It's a bit hard to understand the proposed method given the writing, which needs to be improved particularly at places critical for elaborating the core module and results. For example, reading through section 1.1 and section 3, it's unclear what "sufficient statistic" refers to, which is an important inputs to the proposed adapter module. It's not until in later sections the authors gave examples for statistic such as class accuracies. Another example, reading through p. 7 lines 324-328, I s
The paper idea is simple and reasonable. The overall framework is efficient and easy to deploy. Compared to without adapdation, the model significantly improved the performance.
1. Some of the definitions in the paper is clear, which increased the difficulty to understand the paper. 2 The benchmark dataset setting is not clearly stated.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimedia Communication and Technology
MethodsAdapter
