How Natural Language Proficiency Shapes GenAI Code for Software Engineering Tasks
Ruksit Rojpaisarnkit, Youmei Fan, Kenichi Matsumoto, and Raula Gaikovina Kula

TL;DR
This study explores how English language proficiency in prompts influences the quality and correctness of code generated by large language models in software engineering, revealing that higher proficiency leads to more accurate outputs.
Contribution
It systematically analyzes the impact of natural language proficiency on code generation quality, highlighting its significance beyond prompt structure in LLM-based software engineering.
Findings
Higher language proficiency prompts improve code correctness across models
LLMs tend to default to an intermediate (B2) English level
Proficiency impacts code quality more than prompt structure alone
Abstract
With the widespread adoption of Foundation Model (FM)-powered tools in software engineering, the natural language prompt has become a critical interface between developers and Large Language Models (LLMs). While much research has focused on prompt structure, the natural language proficiency is an underexplored factor that can influence the quality of generated code. This paper investigates whether the English language proficiency itself independent of the prompting technique affects the proficiency and correctness of code generated by LLMs. Using the HumanEval dataset, we systematically varied the English proficiency of prompts from basic to advanced for 164 programming tasks and measured the resulting code proficiency and correctness. Our findings show that LLMs default to an intermediate (B2) natural language level. While the effect on the resulting code proficiency was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Software Testing and Debugging Techniques
