User Misconceptions of LLM-Based Conversational Programming Assistants
Gabrielle O'Brien, Antonio Pedro Santos Alves, Sebastian Baltes, Grischa Liebel, Mircea Lungu, Marcos Kalinowski

TL;DR
This paper investigates user misconceptions about LLM-based programming assistants, highlighting the need for clearer communication of tool capabilities to prevent over-reliance and improve programming practices.
Contribution
It provides a detailed characterization of misconceptions through qualitative analysis of real user conversations, emphasizing the importance of transparent tool communication.
Findings
Users have misplaced expectations about web access and code execution.
Misconceptions can lead to over-reliance and unproductive practices.
Deeper issues involve understanding debugging and validation information.
Abstract
Programming assistants powered by large language models (LLMs) have become widely available, with conversational assistants like ChatGPT particularly accessible to novice programmers. However, varied tool capabilities and inconsistent availability of extensions (web search, code execution, retrieval-augmented generation) create opportunities for user misconceptions that may lead to over-reliance, unproductive practices, or insufficient quality control. We characterize misconceptions that users of conversational LLM-based assistants may have in programming contexts through a two-phase approach: first brainstorming and cataloging potential misconceptions, then conducting qualitative analysis of Python-programming conversations from the WildChat dataset. We find evidence that users have misplaced expectations about features like web access, code execution, and non-text outputs. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
