Exploring the Effect of Multiple Natural Languages on Code Suggestion Using GitHub Copilot
Kei Koyanagi, Dong Wang, Kotaro Noguchi, Masanari Kondo, Alexander, Serebrenik, Yasutaka Kamei, Naoyasu Ubayashi

TL;DR
This study investigates how different natural languages (English, Japanese, Chinese) influence GitHub Copilot's code suggestion performance, revealing language-dependent variations and the impact of question difficulty.
Contribution
It provides the first empirical analysis of natural language effects on Copilot, highlighting language bias and performance decline with increasing question difficulty.
Findings
Chinese performs the worst among the three languages.
Performance drops significantly as question difficulty increases.
Natural language influences Copilot's code suggestion effectiveness.
Abstract
GitHub Copilot is an AI-enabled tool that automates program synthesis. It has gained significant attention since its launch in 2021. Recent studies have extensively examined Copilot's capabilities in various programming tasks, as well as its security issues. However, little is known about the effect of different natural languages on code suggestion. Natural language is considered a social bias in the field of NLP, and this bias could impact the diversity of software engineering. To address this gap, we conducted an empirical study to investigate the effect of three popular natural languages (English, Japanese, and Chinese) on Copilot. We used 756 questions of varying difficulty levels from AtCoder contests for evaluation purposes. The results highlight that the capability varies across natural languages, with Chinese achieving the worst performance. Furthermore, regardless of the type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTeaching and Learning Programming
