How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary   Investigation

Zhongyi Han; Guanglin Zhou; Rundong He; Jindong Wang; Tailin Wu,; Yilong Yin; Salman Khan; Lina Yao; Tongliang Liu; Kun Zhang

arXiv:2312.07424·cs.LG·February 27, 2024·1 cites

How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation

Zhongyi Han, Guanglin Zhou, Rundong He, Jindong Wang, Tailin Wu,, Yilong Yin, Salman Khan, Lina Yao, Tongliang Liu, Kun Zhang

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper evaluates GPT-4V(ision)'s ability to adapt to distribution shifts across diverse datasets and perturbations, revealing its strengths and limitations in robustness and generalization in dynamic environments.

Contribution

It provides a comprehensive benchmark of GPT-4V's zero-shot and perturbed data performance, highlighting its adaptability and identifying areas for improvement.

Findings

01

GPT-4V shows strong zero-shot performance on certain datasets.

02

Performance degrades under specific data perturbations.

03

In-context learning can enhance GPT-4V's adaptation capabilities.

Abstract

In machine learning, generalization against distribution shifts -- where deployment conditions diverge from the training scenarios -- is crucial, particularly in fields like climate modeling, biomedicine, and autonomous driving. The emergence of foundation models, distinguished by their extensive pretraining and task versatility, has led to an increased interest in their adaptability to distribution shifts. GPT-4V(ision) acts as the most advanced publicly accessible multimodal foundation model, with extensive applications across various domains, including anomaly detection, video understanding, image generation, and medical diagnosis. However, its robustness against data distributions remains largely underexplored. Addressing this gap, this study rigorously evaluates GPT-4V's adaptability and generalization capabilities in dynamic environments, benchmarking against prominent models like…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jameszhou-gl/gpt-4v-distribution-shift
noneOfficial

Datasets

jameszhou-gl/gpt-4v-distribution-shift
dataset· 99 dl
99 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Topic Modeling

MethodsContrastive Language-Image Pre-training