Empirical Analysis of the Effect of Context in the Task of Automated Essay Scoring in Transformer-Based Models
Abhirup Chakravarty

TL;DR
This paper investigates how incorporating various contextual factors into transformer-based models improves automated essay scoring, achieving higher accuracy and demonstrating the value of contextual enrichment over architecture modifications.
Contribution
It introduces a contextual augmentation approach for transformer-based AES models, significantly enhancing their performance across multiple essay sets.
Findings
Achieved a mean Quadratic Weighted Kappa score of 0.823 on the full dataset.
Outperformed prior transformer models in most essay sets.
Only 3.83% below the state-of-the-art deep learning model trained per essay set.
Abstract
Automated Essay Scoring (AES) has emerged to prominence in response to the growing demand for educational automation. Providing an objective and cost-effective solution, AES standardises the assessment of extended responses. Although substantial research has been conducted in this domain, recent investigations reveal that alternative deep-learning architectures outperform transformer-based models. Despite the successful dominance in the performance of the transformer architectures across various other tasks, this discrepancy has prompted a need to enrich transformer-based AES models through contextual enrichment. This study delves into diverse contextual factors using the ASAP-AES dataset, analysing their impact on transformer-based model performance. Our most effective model, augmented with multiple contextual dimensions, achieves a mean Quadratic Weighted Kappa score of 0.823 across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
