Data-Mining Textual Responses to Uncover Misconception Patterns
Joshua J. Michalenko, Andrew S. Lan, Richard G. Baraniuk

TL;DR
This paper introduces a natural language processing framework that automatically detects and classifies student misconceptions from textual responses, enabling scalable and targeted feedback in education.
Contribution
It presents a probabilistic model for analyzing student responses to identify common misconceptions without manual enumeration.
Findings
Framework outperforms baseline in classifying misconceptions
Automatically detects common misconceptions across responses
Scalable approach suitable for large classes
Abstract
An important, yet largely unstudied, problem in student data analysis is to detect misconceptions from students' responses to open-response questions. Misconception detection enables instructors to deliver more targeted feedback on the misconceptions exhibited by many students in their class, thus improving the quality of instruction. In this paper, we propose a new natural language processing-based framework to detect the common misconceptions among students' textual responses to short-answer questions. We propose a probabilistic model for students' textual responses involving misconceptions and experimentally validate it on a real-world student-response dataset. Experimental results show that our proposed framework excels at classifying whether a response exhibits one or more misconceptions. More importantly, it can also automatically detect the common misconceptions exhibited across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Software Engineering Research
