TL;DR
This study evaluates GitHub's Copilot through a user study to understand its impact on code security, revealing potential security benefits especially for complex problems, but highlighting the need for further research.
Contribution
It provides the first user-centered security evaluation of Copilot, analyzing its effects on developer security performance across different problem complexities.
Findings
Copilot leads to more secure solutions on harder problems.
No significant impact of Copilot on security for easier problems.
Use of Copilot does not disproportionately affect vulnerability types.
Abstract
Code generation tools driven by artificial intelligence have recently become more popular due to advancements in deep learning and natural language processing that have increased their capabilities. The proliferation of these tools may be a double-edged sword because while they can increase developer productivity by making it easier to write code, research has shown that they can also generate insecure code. In this paper, we perform a user-centered evaluation GitHub's Copilot to better understand its strengths and weaknesses with respect to code security. We conduct a user study where participants solve programming problems (with and without Copilot assistance) that have potentially vulnerable solutions. The main goal of the user study is to determine how the use of Copilot affects participants' security performance. In our set of participants (n=25), we find that access to Copilot…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
