Enhancing Automated Essay Scoring with Three Techniques: Two-Stage Fine-Tuning, Score Alignment, and Self-Training
Hongseok Choi, Serynn Kim, Wencke Liermann, Jin Seong, Jin-Xia Huang

TL;DR
This paper introduces three techniques—Two-Stage fine-tuning, Score Alignment, and Self-Training—to significantly improve automated essay scoring, especially in scenarios with limited labeled data, achieving near full-data performance with fewer samples.
Contribution
The paper presents a novel combination of three techniques to enhance AES performance, particularly under data scarcity, and demonstrates their effectiveness on the ASAP++ dataset.
Findings
All three techniques improve performance in limited-data settings.
Score Alignment enhances consistency and achieves state-of-the-art results.
Integration of techniques reaches 91.2% of full-data performance with 32 samples.
Abstract
Automated Essay Scoring (AES) plays a crucial role in education by providing scalable and efficient assessment tools. However, in real-world settings, the extreme scarcity of labeled data severely limits the development and practical adoption of robust AES systems. This study proposes a novel approach to enhance AES performance in both limited-data and full-data settings by introducing three key techniques. First, we introduce a Two-Stage fine-tuning strategy that leverages low-rank adaptations to better adapt an AES model to target prompt essays. Second, we introduce a Score Alignment technique to improve consistency between predicted and true score distributions. Third, we employ uncertainty-aware self-training using unlabeled data, effectively expanding the training set with pseudo-labeled samples while mitigating label noise propagation. We implement above three key techniques on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Topic Modeling
