ChatGPT Prompting Cannot Estimate Predictive Uncertainty in High-Resource Languages
Martino Pelucchi, Matias Valdenegro-Toro

TL;DR
This study evaluates ChatGPT's performance in high-resource languages and finds it struggles to accurately estimate its answer confidence, often being overconfident across multiple languages and NLP tasks.
Contribution
It provides the first analysis of ChatGPT's confidence calibration in high-resource languages, revealing overconfidence issues and performance similarities among these languages.
Findings
ChatGPT performs similarly across the five high-resource languages.
It exhibits poor confidence calibration, often overestimating its answer accuracy.
ChatGPT never assigns low confidence levels to its responses.
Abstract
ChatGPT took the world by storm for its impressive abilities. Due to its release without documentation, scientists immediately attempted to identify its limits, mainly through its performance in natural language processing (NLP) tasks. This paper aims to join the growing literature regarding ChatGPT's abilities by focusing on its performance in high-resource languages and on its capacity to predict its answers' accuracy by giving a confidence level. The analysis of high-resource languages is of interest as studies have shown that low-resource languages perform worse than English in NLP tasks, but no study so far has analysed whether high-resource languages perform as well as English. The analysis of ChatGPT's confidence calibration has not been carried out before either and is critical to learn about ChatGPT's trustworthiness. In order to study these two aspects, five high-resource…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education
