A Categorical Archive of ChatGPT Failures

Ali Borji

arXiv:2302.03494·cs.CL·April 5, 2023·94 cites

A Categorical Archive of ChatGPT Failures

Ali Borji

PDF

Open Access 1 Repo

TL;DR

This paper categorizes and analyzes the various failure modes of ChatGPT, highlighting its limitations and societal implications to guide future improvements in language models.

Contribution

It provides a comprehensive taxonomy of ChatGPT failures across eleven categories, which was previously lacking in the literature.

Findings

01

Identified eleven categories of ChatGPT failures

02

Discussed risks and societal implications of these failures

03

Provided insights to improve future language models

Abstract

Large language models have been demonstrated to be valuable in different fields. ChatGPT, developed by OpenAI, has been trained using massive amounts of data and simulates human conversation by comprehending context and generating appropriate responses. It has garnered significant attention due to its ability to effectively answer a broad range of human inquiries, with fluent and comprehensive answers surpassing prior public chatbots in both security and usefulness. However, a comprehensive analysis of ChatGPT's failures is lacking, which is the focus of this study. Eleven categories of failures, including reasoning, factual errors, math, coding, and bias, are presented and discussed. The risks, limitations, and societal implications of ChatGPT are also highlighted. The goal of this study is to assist researchers and developers in enhancing future language models and chatbots.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aliborji/chatgpt_failures
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling