Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams
Aayam Bansal

TL;DR
Sketch2Feedback introduces a structured framework that combines perception, symbolic reasoning, and language models to provide accurate, rubric-aligned feedback on STEM diagrams, reducing hallucinations and improving trustworthiness.
Contribution
The paper presents a novel grammar-in-the-loop pipeline that verifies diagram violations before language model verbalization, enhancing feedback accuracy and reliability in STEM education.
Findings
Qwen2-VL-7B achieves highest F1 but with high hallucination rates.
Ensemble oracle improves F1 and reduces hallucinations.
Grammar pipeline yields more actionable feedback than end-to-end models.
Abstract
Providing timely, rubric-aligned feedback on student-drawn diagrams is a persistent challenge in STEM education. While large multimodal models (LMMs) can jointly parse images and generate explanations, their tendency to hallucinate undermines trust in classroom deployments. We present Sketch2Feedback, a grammar-in-the-loop framework that decomposes the problem into four stages -- hybrid perception, symbolic graph construction, constraint checking, and constrained VLM feedback -- so that the language model verbalizes only violations verified by an upstream rule engine. We evaluate on two synthetic micro-benchmarks, FBD-10 (free-body diagrams) and Circuit-10 (circuit schematics), each with 500 images spanning standard and hard noise augmentation tiers, comparing our pipeline against end-to-end LMMs (LLaVA-1.5-7B, Qwen2-VL-7B), a vision-only detector, a YOLOv8-nano learned detector, and an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Science Education and Pedagogy · Model Reduction and Neural Networks
