Individual and Combined Effects of English as a Second Language and Typos on LLM Performance
Serena Liu, Yutong Yang, Prisha Sheth, Weixuan Dong, Mingjiao Diao, Xinru Zhu, Nikhil Banga, Oscar Melendez, Arnav Sharma, Minda Zhao, Marina Lin, and Mengyu Wang

TL;DR
This paper investigates how ESL variations and typos jointly affect large language model performance, revealing that their combined impact often exceeds individual effects, especially on closed-ended tasks.
Contribution
It introduces a framework to systematically analyze the combined effects of ESL and typos on LLMs, highlighting the importance of realistic evaluation scenarios.
Findings
Combined ESL and typos cause larger performance drops than either factor alone.
Performance degradation is more consistent on closed-ended tasks.
Evaluations on clean English may overestimate real-world model performance.
Abstract
Large language models (LLMs) are used globally, and because much of their training data is in English, they typically perform best on English inputs. As a result, many non-native English speakers interact with them in English as a second language (ESL), and these inputs often contain typographical errors. Prior work has largely studied the effects of ESL variation and typographical errors separately, even though they often co-occur in real-world use. In this study, we use the Trans-EnV framework to transform standard English inputs into eight ESL variants and apply MulTypo to inject typos at three levels: low, moderate, and severe. We find that combining ESL variation and typos generally leads to larger performance drops than either factor alone, though the combined effect is not simply additive. This pattern is clearest on closed-ended tasks, where performance degradation can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
