Enhancing Automated Essay Scoring with Three Techniques: Two-Stage Fine-Tuning, Score Alignment, and Self-Training

Hongseok Choi; Serynn Kim; Wencke Liermann; Jin Seong; Jin-Xia Huang

arXiv:2602.01747·cs.CL·February 3, 2026

Enhancing Automated Essay Scoring with Three Techniques: Two-Stage Fine-Tuning, Score Alignment, and Self-Training

Hongseok Choi, Serynn Kim, Wencke Liermann, Jin Seong, Jin-Xia Huang

PDF

Open Access

TL;DR

This paper introduces three techniques—Two-Stage fine-tuning, Score Alignment, and Self-Training—to significantly improve automated essay scoring, especially in scenarios with limited labeled data, achieving near full-data performance with fewer samples.

Contribution

The paper presents a novel combination of three techniques to enhance AES performance, particularly under data scarcity, and demonstrates their effectiveness on the ASAP++ dataset.

Findings

01

All three techniques improve performance in limited-data settings.

02

Score Alignment enhances consistency and achieves state-of-the-art results.

03

Integration of techniques reaches 91.2% of full-data performance with 32 samples.

Abstract

Automated Essay Scoring (AES) plays a crucial role in education by providing scalable and efficient assessment tools. However, in real-world settings, the extreme scarcity of labeled data severely limits the development and practical adoption of robust AES systems. This study proposes a novel approach to enhance AES performance in both limited-data and full-data settings by introducing three key techniques. First, we introduce a Two-Stage fine-tuning strategy that leverages low-rank adaptations to better adapt an AES model to target prompt essays. Second, we introduce a Score Alignment technique to improve consistency between predicted and true score distributions. Third, we employ uncertainty-aware self-training using unlabeled data, effectively expanding the training set with pseudo-labeled samples while mitigating label noise propagation. We implement above three key techniques on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Topic Modeling