Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation
Tung Phung, Victor-Alexandru P\u{a}durean, Anjali Singh, Christopher, Brooks, Jos\'e Cambronero, Sumit Gulwani, Adish Singla, Gustavo Soares

TL;DR
This paper presents a novel approach using GPT-4 to generate programming hints and GPT-3.5 to validate their quality, significantly improving automated feedback for programming students.
Contribution
It introduces GPT4Hints-GPT3.5Val, a new technique that leverages two different GPT models for high-quality hint generation and validation in programming education.
Findings
GPT-4 generates more accurate hints using symbolic info.
GPT-3.5 effectively validates hint quality automatically.
The approach outperforms existing models on real-world datasets.
Abstract
Generative AI and large language models hold great promise in enhancing programming education by automatically generating individualized feedback for students. We investigate the role of generative AI models in providing human tutor-style programming hints to help students resolve errors in their buggy programs. Recent works have benchmarked state-of-the-art models for various feedback generation scenarios; however, their overall quality is still inferior to human tutors and not yet ready for real-world deployment. In this paper, we seek to push the limits of generative AI models toward providing high-quality programming hints and develop a novel technique, GPT4Hints-GPT3.5Val. As a first step, our technique leverages GPT-4 as a ``tutor'' model to generate hints -- it boosts the generative quality by using symbolic information of failing test cases and fixes in prompts. As a next step,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Data Classification · Oil and Gas Production Techniques
Methods{Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Cosine Annealing · Label Smoothing · Absolute Position Encodings · Layer Normalization · Softmax · Dropout
