Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural   Scrambled Text

Qi Cao; Takeshi Kojima; Yutaka Matsuo; Yusuke Iwasawa

arXiv:2311.18805·cs.CL·December 1, 2023·1 cites

Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text

Qi Cao, Takeshi Kojima, Yutaka Matsuo, Yusuke Iwasawa

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that GPT-4 exhibits near-perfect ability to correct and understand extensively scrambled text, revealing remarkable resilience of LLMs to unnatural input errors, unlike other models.

Contribution

The study introduces the Scrambled Bench to evaluate LLMs' handling of scrambled input and shows GPT-4's exceptional error correction capabilities under extreme conditions.

Findings

01

GPT-4 reduces edit distance by 95% on scrambled sentences

02

Most LLMs perform poorly on scrambled input, except GPT-4

03

GPT-4 nearly flawlessly reconstructs original sentences from scrambled text

Abstract

While Large Language Models (LLMs) have achieved remarkable performance in many tasks, much about their inner workings remains unclear. In this study, we present novel experimental insights into the resilience of LLMs, particularly GPT-4, when subjected to extensive character-level permutations. To investigate this, we first propose the Scrambled Bench, a suite designed to measure the capacity of LLMs to handle scrambled input, in terms of both recovering scrambled sentences and answering questions given scrambled context. The experimental results indicate that most powerful LLMs demonstrate the capability akin to typoglycemia, a phenomenon where humans can understand the meaning of words even when the letters within those words are scrambled, as long as the first and last letters remain in place. More surprisingly, we found that only GPT-4 nearly flawlessly processes inputs with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ccqq77/unnatural-error-correction
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Residual Connection · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization