H-AES: Towards Automated Essay Scoring for Hindi

Shubhankar Singh; Anirudh Pupneja; Shivaansh Mital; Cheril Shah,; Manish Bawkar; Lakshman Prasad Gupta; Ajit Kumar; Yaman Kumar; Rushali Gupta,; Rajiv Ratn Shah

arXiv:2302.14635·cs.CL·March 1, 2023·1 cites

H-AES: Towards Automated Essay Scoring for Hindi

Shubhankar Singh, Anirudh Pupneja, Shivaansh Mital, Cheril Shah,, Manish Bawkar, Lakshman Prasad Gupta, Ajit Kumar, Yaman Kumar, Rushali Gupta,, Rajiv Ratn Shah

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores automated essay scoring for Hindi using classical and advanced NLP models, demonstrating comparable performance to English AES despite Hindi's low-resource status.

Contribution

It adapts and evaluates state-of-the-art AES methods for Hindi, a low-resource language, using translated data and real-world corpus analysis.

Findings

01

Models achieve performance comparable to English AES.

02

End-to-end models outperform classical feature-based methods.

03

Prompt-specific behaviors influence model performance.

Abstract

The use of Natural Language Processing (NLP) for Automated Essay Scoring (AES) has been well explored in the English language, with benchmark models exhibiting performance comparable to human scorers. However, AES in Hindi and other low-resource languages remains unexplored. In this study, we reproduce and compare state-of-the-art methods for AES in the Hindi domain. We employ classical feature-based Machine Learning (ML) and advanced end-to-end models, including LSTM Networks and Fine-Tuned Transformer Architecture, in our approach and derive results comparable to those in the English language domain. Hindi being a low-resource language, lacks a dedicated essay-scoring corpus. We train and evaluate our models using translated English essays and empirically measure their performance on our own small-scale, real-world Hindi corpus. We follow this up with an in-depth analysis discussing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

midas-research/hindi-aes
noneOfficial

Videos

H-AES: Towards Automated Essay Scoring for Hindi· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Tanh Activation · Label Smoothing · Softmax · Sigmoid Activation · Adam · Layer Normalization