Automated assessment of non-native learner essays: Investigating the   role of linguistic features

Sowmya Vajjala

arXiv:1612.00729·cs.CL·February 9, 2017

Automated assessment of non-native learner essays: Investigating the role of linguistic features

Sowmya Vajjala

PDF

1 Repo

TL;DR

This study investigates which linguistic features are most effective for automatic scoring of non-native English essays, revealing dataset-specific predictive features and demonstrating the importance of lexical, syntactic, discourse, and error features.

Contribution

It systematically evaluates the role of various linguistic features in AES across different datasets, highlighting dataset-specific predictive features and advancing understanding of feature importance.

Findings

01

Good predictive models achieved with the feature set

02

Most predictive features vary across datasets

03

Linguistic features include lexical, syntactic, discourse, and error types

Abstract

Automatic essay scoring (AES) refers to the process of scoring free text responses to given prompts, considering human grader scores as the gold standard. Writing such essays is an essential component of many language and aptitude exams. Hence, AES became an active and established area of research, and there are many proprietary systems used in real life applications today. However, not much is known about which specific linguistic features are useful for prediction and how much of this is consistent across datasets. This article addresses that by exploring the role of various linguistic features in automatic essay scoring using two publicly available datasets of non-native English essays written in test taking scenarios. The linguistic properties are modeled by encoding lexical, syntactic, discourse and error types of learner language in the feature set. Predictive models are then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://bitbucket.org/nishkalavallabhi/ijaiedpapercode
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.