Linguistic Features for Readability Assessment

Tovly Deutsch; Masoud Jasbi; Stuart Shieber

arXiv:2006.00377·cs.CL·August 4, 2020

Linguistic Features for Readability Assessment

Tovly Deutsch, Masoud Jasbi, Stuart Shieber

PDF

1 Repo

TL;DR

This study investigates whether adding linguistically motivated features to deep learning models enhances readability assessment, finding that with enough data, deep models alone suffice, indicating they may already encode such features.

Contribution

The paper combines traditional linguistically motivated features with deep learning models to evaluate their combined effect on readability assessment performance.

Findings

01

Augmenting deep models with linguistic features does not improve performance with sufficient data.

02

Deep learning models may inherently learn linguistically relevant features.

03

Traditional features may be redundant in high-data regimes.

Abstract

Readability assessment aims to automatically classify text by the level appropriate for learning readers. Traditional approaches to this task utilize a variety of linguistically motivated features paired with simple machine learning models. More recent methods have improved performance by discarding these features and utilizing deep learning models. However, it is unknown whether augmenting deep learning models with linguistically motivated features would improve performance further. This paper combines these two approaches with the goal of improving overall model performance and addressing this question. Evaluating on two large readability corpora, we find that, given sufficient training data, augmenting deep learning models with linguistically motivated features does not improve state-of-the-art performance. Our results provide preliminary evidence for the hypothesis that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TovlyDeutsch/Linguistic-Features-for-Readability
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.