Stacking Neural Network Models for Automatic Short Answer Scoring

Rian Adam Rajagede; Rochana Prih Hastuti

arXiv:2010.11092·cs.CL·March 24, 2021

Stacking Neural Network Models for Automatic Short Answer Scoring

Rian Adam Rajagede, Rochana Prih Hastuti

PDF

TL;DR

This paper presents a stacking neural network and XGBoost model with data augmentation and hyperparameter tuning for improved automatic short answer scoring, achieving higher F1-score on a benchmark dataset.

Contribution

It introduces a novel stacking approach combined with data upsampling and hyperparameter optimization for short answer scoring.

Findings

01

Achieved an F1-score of 0.821 on Ukara 1.0 dataset

02

Outperformed previous models on the same dataset

03

Demonstrated effectiveness of data augmentation and hyperparameter tuning

Abstract

Automatic short answer scoring is one of the text classification problems to assess students' answers during exams automatically. Several challenges can arise in making an automatic short answer scoring system, one of which is the quantity and quality of the data. The data labeling process is not easy because it requires a human annotator who is an expert in their field. Further, the data imbalance process is also a challenge because the number of labels for correct answers is always much less than the wrong answers. In this paper, we propose the use of a stacking model based on neural network and XGBoost for classification process with sentence embedding feature. We also propose to use data upsampling method to handle imbalance classes and hyperparameters optimization algorithm to find a robust model automatically. We use Ukara 1.0 Challenge dataset and our best model obtained an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.