Closing the Gap: A User Study on the Real-world Usefulness of AI-powered Vulnerability Detection & Repair in the IDE
Benjamin Steenhoek, Kalpathy Sivaraman, Renata Saldivar Gonzalez,, Yevhen Mohylevskyy, Roshanak Zilouchian Moghaddam, and Wei Le

TL;DR
This study evaluates an AI-powered vulnerability detection and repair tool in real-world software development, revealing its potential and current limitations through a user study with professional developers.
Contribution
First empirical evaluation of an IDE-integrated vulnerability detection and fix tool with professional developers on real projects, providing insights into practical usability and challenges.
Findings
High false positive rate limits practicality
User feedback highlights need for better context and customization
AI features like explanations and chat improve user trust
Abstract
This paper presents the first empirical study of a vulnerability detection and fix tool with professional software developers on real projects that they own. We implemented DeepVulGuard, an IDE-integrated tool based on state-of-the-art detection and fix models, and show that it has promising performance on benchmarks of historic vulnerability data. DeepVulGuard scans code for vulnerabilities (including identifying the vulnerability type and vulnerable region of code), suggests fixes, provides natural-language explanations for alerts and fixes, leveraging chat interfaces. We recruited 17 professional software developers at Microsoft, observed their usage of the tool on their code, and conducted interviews to assess the tool's usefulness, speed, trust, relevance, and workflow integration. We also gathered detailed qualitative feedback on users' perceptions and their desired features.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Network Security and Intrusion Detection · Information and Cyber Security
