Zero-Shot Robustification of Zero-Shot Models
Dyah Adila, Changho Shin, Linrong Cai, Frederic Sala

TL;DR
RoboShot enhances the robustness of zero-shot pretrained models by removing biases from embeddings using language models, without additional training, leading to significant improvements in worst-group accuracy across multiple tasks.
Contribution
The paper introduces RoboShot, a novel zero-shot method that improves model robustness by bias mitigation in embeddings using language models, without requiring supervision or fine-tuning.
Findings
Average 15.98% improvement in worst group accuracy
Compatible with various pretrained and language models
Maintains overall accuracy while boosting robustness
Abstract
Zero-shot inference is a powerful paradigm that enables the use of large pretrained models for downstream classification tasks without further training. However, these models are vulnerable to inherited biases that can impact their performance. The traditional solution is fine-tuning, but this undermines the key advantage of pretrained models, which is their ability to be used out-of-the-box. We propose RoboShot, a method that improves the robustness of pretrained model embeddings in a fully zero-shot fashion. First, we use language models (LMs) to obtain useful insights from task descriptions. These insights are embedded and used to remove harmful and boost useful components in embeddings -- without any supervision. Theoretically, we provide a simple and tractable model for biases in zero-shot embeddings and give a result characterizing under what conditions our approach can boost…
Peer Reviews
Decision·ICLR 2024 poster
- The proposed ROBOSHOT method is an interesting novel approach that improves the robustness of zero-shot models against harmful concepts without the manual identification of harmful concepts. It leverages insights obtained from large language models to refine embeddings, and address inherited biases. I also find the theoretical arguments interesting which characterizes the conditions under which ROBOSHOT can outperform existing methods in zero shot learning. - The paper presents a well-stru
- No large scale datasets like imagenet - The benchmarks are limited to zero shot classification which is an easy task compared to zero-shot semantic segmentation and instance segmentation where this method could struggle. - I don't see this work as actual zero shot because the pretrained model has so much information about the classes present in the chosen datasets. This work would be more impactful if the experiments were conducted on rare classes to test whether this method generalizes well
- The overall motivation and idea of the paper is quite novel with interesting applications across various foundational models like CLIP and LLM. - The empirical results also are quite strong.
- $X_{proj}$ has not been defined in LFA section. - The authors should clarify how the basis vectors ($z$) are identified in the experiments, as the decomposition of insight vectors is based on that. - I understand that the proposed approach is poised to give major gains in class imbalance settings or well known setting with spurious features. However, I encourage the authors to also provide results in standard classification tasks like imagenet using CLIP. I fear that in many standard tasks, re
### Originality - The paper proposes novel methodology to improve the robustness of zero-shot models such as CLIP without fine-tuning or using extra data. ### Quality - The simplicity of the debiasing techniques makes it a cheap solution that can be easily employed by any practitioner. - Accepting the assumptions, the theoretical analysis makes intuitive sense. ### Clarity - The paper makes for a pleasing read: it is well written, easy to read and mostly clear. - The framework is clearly ex
- The assumption of concept embeddings being orthonormal doesn’t seem well motivated; do we expect the concept embedding of ‘waterbird’ to be orthogonal to ‘water’? I find the experiment in Appendix F.5 to be inconclusive due to the simplicity of the considered concepts, and I can’t immediately understand why the average of the images having a higher cosine similarity should give any insight on the decomposition of the space in harmful, helpful and neutral subspaces. - Unfortunately, the ove
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications · Advanced Radiotherapy Techniques
