On the Robustness of ChatGPT: An Adversarial and Out-of-distribution   Perspective

Jindong Wang; Xixu Hu; Wenxin Hou; Hao Chen; Runkai Zheng; Yidong; Wang; Linyi Yang; Haojun Huang; Wei Ye; Xiubo Geng; Binxin Jiao; Yue Zhang,; Xing Xie

arXiv:2302.12095·cs.AI·August 30, 2023·90 cites

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

Jindong Wang, Xixu Hu, Wenxin Hou, Hao Chen, Runkai Zheng, Yidong, Wang, Linyi Yang, Haojun Huang, Wei Ye, Xiubo Geng, Binxin Jiao, Yue Zhang,, Xing Xie

PDF

Open Access 1 Repo

TL;DR

This paper evaluates ChatGPT's robustness against adversarial and out-of-distribution inputs, revealing strengths in some tasks but also significant vulnerabilities, and discusses future research directions for improving foundation models.

Contribution

It provides a comprehensive robustness assessment of ChatGPT from adversarial and OOD perspectives, highlighting its advantages and remaining challenges.

Findings

01

ChatGPT outperforms baselines on most adversarial and OOD tasks.

02

Performance is still far from perfect, indicating robustness issues.

03

ChatGPT excels in understanding dialogue but offers informal medical suggestions.

Abstract

ChatGPT is a recent chatbot service released by OpenAI and is receiving increasing attention over the past few months. While evaluations of various aspects of ChatGPT have been done, its robustness, i.e., the performance to unexpected inputs, is still unclear to the public. Robustness is of particular concern in responsible AI, especially for safety-critical applications. In this paper, we conduct a thorough evaluation of the robustness of ChatGPT from the adversarial and out-of-distribution (OOD) perspective. To do so, we employ the AdvGLUE and ANLI benchmarks to assess adversarial robustness and the Flipkart review and DDXPlus medical diagnosis datasets for OOD evaluation. We select several popular foundation models as baselines. Results show that ChatGPT shows consistent advantages on most adversarial and OOD classification and translation tasks. However, the absolute performance is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/robustlearn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)

Methodstravel james