Predicting the Relative Difficulty of Single Sentences With and Without Surrounding Context
Elliot Schumacher, Maxine Eskenazi, Gwen Frishkoff, Kevyn, Collins-Thompson

TL;DR
This paper presents a new method for predicting the relative reading difficulty of sentences, considering the influence of surrounding context, which is crucial for applications like language tutoring systems.
Contribution
It introduces a novel approach combining lexical and grammatical features to assess sentence difficulty with and without context, using logistic regression and Bayesian ranking.
Findings
Contextual features improve difficulty prediction accuracy.
Models can differentiate difficulty based on surrounding passage influence.
Ranking consistency varies with context presence.
Abstract
The problem of accurately predicting relative reading difficulty across a set of sentences arises in a number of important natural language applications, such as finding and curating effective usage examples for intelligent language tutoring systems. Yet while significant research has explored document- and passage-level reading difficulty, the special challenges involved in assessing aspects of readability for single sentences have received much less attention, particularly when considering the role of surrounding passages. We introduce and evaluate a novel approach for estimating the relative reading difficulty of a set of sentences, with and without surrounding context. Using different sets of lexical and grammatical features, we explore models for predicting pairwise relative difficulty using logistic regression, and examine rankings generated by aggregating pairwise difficulty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
