Exploring Data Augmentation and Resampling Strategies for Transformer-Based Models to Address Class Imbalance in AI Scoring of Scientific Explanations in NGSS Classroom
Prudence Djagba, Kevin Haudek, Clare G.C. Franovic, Leonora Kaldaras

TL;DR
This study explores augmentation strategies like GPT-4, EASE, and ALP to improve transformer-based models for classifying student scientific explanations, effectively addressing class imbalance in automated scoring.
Contribution
It introduces novel augmentation methods that significantly enhance model performance and class balance in automated scoring of scientific explanations.
Findings
GPT-4 data improved precision and recall.
ALP achieved near-perfect scores across categories.
Augmentation strategies outperformed traditional oversampling methods.
Abstract
Automated scoring of students' scientific explanations offers the potential for immediate, accurate feedback, yet class imbalance in rubric categories particularly those capturing advanced reasoning remains a challenge. This study investigates augmentation strategies to improve transformer-based text classification of student responses to a physical science assessment based on an NGSS-aligned learning progression. The dataset consists of 1,466 high school responses scored on 11 binary-coded analytic categories. This rubric identifies six important components including scientific ideas needed for a complete explanation along with five common incomplete or inaccurate ideas. Using SciBERT as a baseline, we applied fine-tuning and test these augmentation strategies: (1) GPT-4--generated synthetic responses, (2) EASE, a word-level extraction and filtering approach, and (3) ALP (Augmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
