How to Teach Programming in the AI Era? Using LLMs as a Teachable Agent for Debugging
Qianou Ma, Hua Shen, Kenneth Koedinger, Tongshuang Wu

TL;DR
This paper presents HypoCompass, a system that uses LLMs as teachable agents to enhance programming debugging skills in students by enabling a collaborative learning environment focused on debugging and hypothesis generation.
Contribution
It introduces a novel system that leverages LLMs as teachable agents for debugging, improving training material quality and student debugging performance.
Findings
HypoCompass outperforms human counterparts fourfold in efficiency.
Students' debugging performance improved by 12% after using HypoCompass.
The system effectively delegates tasks between students and LLMs for better learning.
Abstract
Large Language Models (LLMs) now excel at generative skills and can create content at impeccable speeds. However, they are imperfect and still make various mistakes. In a Computer Science education context, as these models are widely recognized as "AI pair programmers," it becomes increasingly important to train students on evaluating and debugging the LLM-generated code. In this work, we introduce HypoCompass, a novel system to facilitate deliberate practice on debugging, where human novices play the role of Teaching Assistants and help LLM-powered teachable agents debug code. We enable effective task delegation between students and LLMs in this learning-by-teaching environment: students focus on hypothesizing the cause of code errors, while adjacent skills like code completion are offloaded to LLM-agents. Our evaluations demonstrate that HypoCompass generates high-quality training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Software Engineering Research · Online Learning and Analytics
