Scaling Equitable Reflection Assessment in Education via Large Language Models and Role-Based Feedback Agents
Chenyu Zhang, Xiaohang Luo

TL;DR
This paper introduces a multi-agent LLM system that provides scalable, equitable formative feedback in education by combining role-based agents for scoring, bias checking, metacognitive prompting, and feedback generation, improving support for diverse learners.
Contribution
The paper presents a novel multi-agent LLM framework with role-based agents that produce equitable, high-quality formative feedback at scale, addressing resource limitations in education.
Findings
System achieves expert-level rubric scoring accuracy.
AI-generated comments are rated as helpful and empathetic.
Fairness checks help monitor and reduce scoring disparities.
Abstract
Formative feedback is widely recognized as one of the most effective drivers of student learning, yet it remains difficult to implement equitably at scale. In large or low-resource courses, instructors often lack the time, staffing, and bandwidth required to review and respond to every student reflection, creating gaps in support precisely where learners would benefit most. This paper presents a theory-grounded system that uses five coordinated role-based LLM agents (Evaluator, Equity Monitor, Metacognitive Coach, Aggregator, and Reflexion Reviewer) to score learner reflections with a shared rubric and to generate short, bias-aware, learner-facing comments. The agents first produce structured rubric scores, then check for potentially biased or exclusionary language, add metacognitive prompts that invite students to think about their own thinking, and finally compose a concise feedback…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Innovative Teaching and Learning Methods · Student Assessment and Feedback
