A Comparative Study of Technical Writing Feedback Quality: Evaluating LLMs, SLMs, and Humans in Computer Science Topics
Suqing Liu, Bogdan Simion, Christopher Eaton, Michael Liut

TL;DR
This study compares the quality of feedback from Large Language Models, Small Language Models, and humans across different computer science courses, highlighting strengths, limitations, and scalability of AI-generated feedback.
Contribution
It provides a comprehensive evaluation of AI versus human feedback in technical writing, emphasizing the potential of hybrid approaches for scalable, high-quality educational feedback.
Findings
AI models deliver clear, actionable feedback in large courses.
Humans provide more nuanced, personalized guidance.
AI feedback is effective at scale but less personalized.
Abstract
Feedback is a critical component of the learning process, particularly in computer science education. This study investigates the quality of feedback generated by Large Language Models (LLMs), Small Language Models (SLMs), compared with human feedback, in three computer science course with technical writing components: an introductory computer science course (CS2), a third-year advanced systems course (operating systems), and a third-year writing course (a topics course on artificial intelligence). Using a mixed-methods approach which integrates quantitative Likert-scale questions with qualitative commentary, we analyze the student perspective on feedback quality, evaluated based on multiple criteria, including readability, detail, specificity, actionability, helpfulness, and overall quality. The analysis reveals that in the larger upper-year operating systems course (), SLMs and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStudent Assessment and Feedback · Teaching and Learning Programming · Educational Assessment and Pedagogy
