A Study on Developer Behaviors for Validating and Repairing LLM-Generated Code Using Eye Tracking and IDE Actions
Ningzhi Tang, Meng Chen, Zheng Ning, Aakash Bansal, Yu Huang, Collin, McMillan, Toby Jia-Jun Li

TL;DR
This study explores how developers validate and repair LLM-generated code, revealing that provenance awareness influences their strategies, attention, and cognitive workload during the process.
Contribution
It provides empirical insights into developer behaviors with LLM-generated code and highlights the impact of code provenance awareness on validation and repair strategies.
Findings
Developers often fail to identify LLM origin without explicit info.
Provenance awareness improves performance and increases search efforts.
Developers exhibit distinct behaviors like switching between code and comments.
Abstract
The increasing use of large language model (LLM)-powered code generation tools, such as GitHub Copilot, is transforming software engineering practices. This paper investigates how developers validate and repair code generated by Copilot and examines the impact of code provenance awareness during these processes. We conducted a lab study with 28 participants, who were tasked with validating and repairing Copilot-generated code in three software projects. Participants were randomly divided into two groups: one informed about the provenance of LLM-generated code and the other not. We collected data on IDE interactions, eye-tracking, cognitive workload assessments, and conducted semi-structured interviews. Our results indicate that, without explicit information, developers often fail to identify the LLM origin of the code. Developers generally employ similar validation and repair strategies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovation in Digital Healthcare Systems
