Buggy rule diagnosis for combined steps through final answer evaluation in stepwise tasks
Gerben van der Hoek, Johan Jeuring, Rogier Bos

TL;DR
This paper proposes a method for diagnosing errors in stepwise tasks by evaluating the final answer, effectively handling combined steps and reducing diagnostic complexity in intelligent tutoring systems.
Contribution
It introduces a final answer-based error diagnosis approach for combined steps, validated on quadratic equation solving data, showing high accuracy and potential for improved tutoring support.
Findings
Final answer evaluation diagnosed 29.4% of complex steps.
Diagnosis aligned with teachers in 97% of cases.
Approach reduces combinatorial explosion in error diagnosis.
Abstract
Many intelligent tutoring systems can support a student in solving a stepwise task. When a student combines several steps in one step, the number of possible paths connecting consecutive inputs may be very large. This combinatorial explosion makes error diagnosis hard. Using a final answer to diagnose a combination of steps can mitigate the combinatorial explosion, because there are generally fewer possible (erroneous) final answers than (erroneous) solution paths. An intermediate input for a task can be diagnosed by automatically completing it according to the task solution strategy and diagnosing this solution. This study explores the potential of automated error diagnosis based on a final answer. We investigate the design of a service that provides a buggy rule diagnosis when a student combines several steps. To validate the approach, we apply the service to an existing dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Teaching and Learning Programming · Educational Technology and Assessment
