Stacking Neural Network Models for Automatic Short Answer Scoring
Rian Adam Rajagede, Rochana Prih Hastuti

TL;DR
This paper presents a stacking neural network and XGBoost model with data augmentation and hyperparameter tuning for improved automatic short answer scoring, achieving higher F1-score on a benchmark dataset.
Contribution
It introduces a novel stacking approach combined with data upsampling and hyperparameter optimization for short answer scoring.
Findings
Achieved an F1-score of 0.821 on Ukara 1.0 dataset
Outperformed previous models on the same dataset
Demonstrated effectiveness of data augmentation and hyperparameter tuning
Abstract
Automatic short answer scoring is one of the text classification problems to assess students' answers during exams automatically. Several challenges can arise in making an automatic short answer scoring system, one of which is the quantity and quality of the data. The data labeling process is not easy because it requires a human annotator who is an expert in their field. Further, the data imbalance process is also a challenge because the number of labels for correct answers is always much less than the wrong answers. In this paper, we propose the use of a stacking model based on neural network and XGBoost for classification process with sentence embedding feature. We also propose to use data upsampling method to handle imbalance classes and hyperparameters optimization algorithm to find a robust model automatically. We use Ukara 1.0 Challenge dataset and our best model obtained an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
