Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

Aayam Bansal

arXiv:2602.18520·cs.CV·February 24, 2026

Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

Aayam Bansal

PDF

Open Access

TL;DR

Sketch2Feedback introduces a structured framework that combines perception, symbolic reasoning, and language models to provide accurate, rubric-aligned feedback on STEM diagrams, reducing hallucinations and improving trustworthiness.

Contribution

The paper presents a novel grammar-in-the-loop pipeline that verifies diagram violations before language model verbalization, enhancing feedback accuracy and reliability in STEM education.

Findings

01

Qwen2-VL-7B achieves highest F1 but with high hallucination rates.

02

Ensemble oracle improves F1 and reduces hallucinations.

03

Grammar pipeline yields more actionable feedback than end-to-end models.

Abstract

Providing timely, rubric-aligned feedback on student-drawn diagrams is a persistent challenge in STEM education. While large multimodal models (LMMs) can jointly parse images and generate explanations, their tendency to hallucinate undermines trust in classroom deployments. We present Sketch2Feedback, a grammar-in-the-loop framework that decomposes the problem into four stages -- hybrid perception, symbolic graph construction, constraint checking, and constrained VLM feedback -- so that the language model verbalizes only violations verified by an upstream rule engine. We evaluate on two synthetic micro-benchmarks, FBD-10 (free-body diagrams) and Circuit-10 (circuit schematics), each with 500 images spanning standard and hard noise augmentation tiers, comparing our pipeline against end-to-end LMMs (LLaVA-1.5-7B, Qwen2-VL-7B), a vision-only detector, a YOLOv8-nano learned detector, and an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Science Education and Pedagogy · Model Reduction and Neural Networks