Beyond Functional Correctness: Design Issues in AI IDE-Generated Large-Scale Projects

Syed Mohammad Kashif; Ruiyin Li; Peng Liang; Amjed Tahir; Qiong Feng; Zengyang Li; Mojtaba Shahin

arXiv:2604.06373·cs.SE·April 9, 2026

Beyond Functional Correctness: Design Issues in AI IDE-Generated Large-Scale Projects

Syed Mohammad Kashif, Ruiyin Li, Peng Liang, Amjed Tahir, Qiong Feng, Zengyang Li, Mojtaba Shahin

PDF

1 Repo

TL;DR

This study evaluates AI IDEs' ability to generate large-scale projects, revealing high functional correctness but prevalent design issues affecting maintainability and adherence to best practices.

Contribution

It introduces the FD-HITL framework for guiding project generation and provides empirical analysis of design issues in AI-generated large-scale software.

Findings

01

AI IDEs can generate large projects with high functional correctness.

02

Generated projects contain significant design issues impacting maintainability.

03

Common issues include code duplication, high complexity, and violations of design principles.

Abstract

New generation of AI coding tools, including AI-powered IDEs equipped with agentic capabilities, can generate code within the context of the project. These AI IDEs are increasingly perceived as capable of producing project-level code at scale. However, there is limited empirical evidence on the extent to which they can generate large-scale software systems and what design issues such systems may exhibit. To address this gap, we conducted a study to explore the capability of Cursor in generating large-scale projects and to evaluate the design quality of projects generated by Cursor. First, we propose a Feature-Driven Human-In-The-Loop (FD-HITL) framework that systematically guides project generation from curated project descriptions. We generated 10 projects using Cursor with the FD-HITL framework across three application domains and multiple technologies. We assessed the functional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Kashifraz/DIinAGP
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.