An Empirical Study of Proactive Coding Assistants in Real-World Software Development
Lehui Li, Ruixuan Jia, Guo-Ye Yang, and Jia Li

TL;DR
This study compares real developer IDE interactions with LLM-simulated traces, revealing significant differences and emphasizing the need for real data in developing effective proactive coding assistants.
Contribution
It provides the first large-scale empirical analysis of real IDE traces versus simulated data and introduces ProCodeBench, a benchmark for proactive intent prediction.
Findings
Simulated IDE traces differ significantly from real traces in diversity and structure.
Current LLM-based approaches perform poorly on real IDE data.
Simulated data can complement real data but cannot replace it for training.
Abstract
Large language model (LLM)-based coding assistants have made substantial progress, yet most systems remain reactive, requiring developers to explicitly formulate their needs. Proactive coding assistants aim to infer latent developer intent from integrated development environment (IDE) interactions and repository context, thereby reducing interaction overhead and supporting more seamless assistance. However, research in this direction is limited by the scarcity of large-scale real-world developer behavior data. Existing studies therefore often rely on LLM-simulated IDE traces, whose fidelity to real development behavior remains unclear. In this paper, we investigate this simulation-to-reality gap through a large-scale empirical study. We collect real IDE interaction traces from 1{,}246 experienced industry developers over three consecutive days using a custom Visual Studio Code…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
