Will Your Next Pair Programming Partner Be Human? An Empirical Evaluation of Generative AI as a Collaborative Teammate in a Semester-Long Classroom Setting
Wenhan Lyu, Yimeng Wang, Yifan Sun, Yixuan Zhang

TL;DR
This study empirically evaluates the impact of generative AI tools as collaborative partners in pair programming within a classroom, showing improved performance and changing student attitudes, while highlighting limitations and design considerations.
Contribution
It provides one of the first empirical comparisons of human and AI pair programming, exploring effects on learning, collaboration, and student perceptions in a semester-long setting.
Findings
Students with AI partners scored highest on assignments.
Attitudes towards AI's programming abilities improved after collaboration.
Students valued AI tools but noted limitations and differing expectations.
Abstract
Generative AI (GenAI), especially Large Language Models (LLMs), is rapidly reshaping both programming workflows and computer science education. Many programmers now incorporate GenAI tools into their workflows, including for collaborative coding tasks such as pair programming. While prior research has demonstrated the benefits of traditional pair programming and begun to explore GenAI-assisted coding, the role of LLM-based tools as collaborators in pair programming remains underexamined. In this work, we conducted a mixed-methods study with 39 undergraduate students to examine how GenAI influences collaboration, learning, and performance in pair programming. Specifically, students completed six in-class assignments under three conditions: Traditional Pair Programming (PP), Pair Programming with GenAI (PAI), and Solo Programming with GenAI (SAI). They used both LLM-based inline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
