Optimizing In-Context Demonstrations for LLM-based Automated Grading

Yucheng Chu; Hang Li; Kaiqi Yang; Yasemin Copur-Gencturk; Kevin Haudek; Joseph Krajcik; Jiliang Tang

arXiv:2603.00465·cs.AI·March 3, 2026

Optimizing In-Context Demonstrations for LLM-based Automated Grading

Yucheng Chu, Hang Li, Kaiqi Yang, Yasemin Copur-Gencturk, Kevin Haudek, Joseph Krajcik, Jiliang Tang

PDF

Open Access

TL;DR

This paper introduces GUIDE, a novel framework that optimizes exemplar selection and rationale generation for LLM-based grading, significantly improving accuracy especially on borderline cases by focusing on rubric boundaries.

Contribution

GUIDE reframes exemplar selection as a boundary-focused optimization, employing contrastive operators and discriminative rationales to enhance LLM grading accuracy.

Findings

01

Outperforms standard retrieval baselines across multiple datasets.

02

Shows robust improvements on borderline and boundary cases.

03

Enhances rubric adherence and grading reliability.

Abstract

Automated assessment of open-ended student responses is a critical capability for scaling personalized feedback in education. While large language models (LLMs) have shown promise in grading tasks via in-context learning (ICL), their reliability is heavily dependent on the selection of few-shot exemplars and the construction of high-quality rationales. Standard retrieval methods typically select examples based on semantic similarity, which often fails to capture subtle decision boundaries required for rubric adherence. Furthermore, manually crafting the expert rationales needed to guide these models can be a significant bottleneck. To address these limitations, we introduce GUIDE (Grading Using Iteratively Designed Exemplars), a framework that reframes exemplar selection and refinement in automated grading as a boundary-focused optimization problem. GUIDE operates on a continuous loop…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Online Learning and Analytics