Mitigating Bias in Automated Grading Systems for ESL Learners: A Contrastive Learning Approach

Kevin Fan; Eric Yun

arXiv:2601.16724·cs.CL·January 26, 2026

Mitigating Bias in Automated Grading Systems for ESL Learners: A Contrastive Learning Approach

Kevin Fan, Eric Yun

PDF

Open Access

TL;DR

This paper addresses bias in automated ESL essay scoring by introducing a contrastive learning method that reduces scoring disparities between native and ESL writers, improving fairness without sacrificing accuracy.

Contribution

It proposes a novel contrastive learning approach with matched essay pairs to mitigate bias in transformer-based AES models for ESL learners.

Findings

01

Reduced ESL-native scoring gap by 39.9%

02

Maintained high model agreement with QWK of 0.76

03

Disentangled sentence complexity from grammatical errors

Abstract

As Automated Essay Scoring (AES) systems are increasingly used in high-stakes educational settings, concerns regarding algorithmic bias against English as a Second Language (ESL) learners have increased. Current Transformer-based regression models trained primarily on native-speaker corpora often learn spurious correlations between surface-level L2 linguistic features and essay quality. In this study, we conduct a bias study of a fine-tuned DeBERTa-v3 model using the ASAP 2.0 and ELLIPSE datasets, revealing a constrained score scaling for high-proficiency ESL writing where high-proficiency ESL essays receive scores 10.3% lower than Native speaker essays of identical human-rated quality. To mitigate this, we propose applying contrastive learning with a triplet construction strategy: Contrastive Learning with Matched Essay Pairs. We constructed a dataset of 17,161 matched essay pairs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Topic Modeling · Second Language Acquisition and Learning