Evaluating CLIP: Towards Characterization of Broader Capabilities and Downstream Implications
Sandhini Agarwal, Gretchen Krueger, Jack Clark, Alec Radford, Jong, Wook Kim, Miles Brundage

TL;DR
This paper critically evaluates CLIP, a generalizable computer vision model, highlighting its capabilities, biases, and implications for safe deployment, emphasizing the need for broader evaluation criteria beyond accuracy.
Contribution
It provides an in-depth analysis of CLIP's capabilities, biases, and deployment challenges, advocating for broader safety and fairness considerations in model evaluation.
Findings
CLIP reduces task-specific training data needs.
CLIP's natural language class specification can influence bias manifestation.
CLIP inherits biases from prior vision systems.
Abstract
Recently, there have been breakthroughs in computer vision ("CV") models that are more generalizable with the advent of models such as CLIP and ALIGN. In this paper, we analyze CLIP and highlight some of the challenges such models pose. CLIP reduces the need for task specific training data, potentially opening up many niche tasks to automation. CLIP also allows its users to flexibly specify image classification classes in natural language, which we find can shift how biases manifest. Additionally, through some preliminary probes we find that CLIP can inherit biases found in prior computer vision systems. Given the wide and unpredictable domain of uses for such models, this raises questions regarding what sufficiently safe behaviour for such systems may look like. These results add evidence to the growing body of work calling for a change in the notion of a 'better' model--to move beyond…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsALIGN · Contrastive Language-Image Pre-training
