A Neural Pairwise Ranking Model for Readability Assessment

Justin Lee; Sowmya Vajjala

arXiv:2203.07450·cs.CL·March 16, 2022

A Neural Pairwise Ranking Model for Readability Assessment

Justin Lee, Sowmya Vajjala

PDF

Open Access 1 Repo

TL;DR

This paper introduces the first neural pairwise ranking model for automatic readability assessment, demonstrating strong monolingual and cross-lingual performance, and releasing a new bilingual dataset.

Contribution

It presents a novel neural pairwise ranking approach for ARA and provides the first cross-lingual zero-shot evaluation results.

Findings

01

Achieves over 80% cross-lingual ranking accuracy in zero-shot settings.

02

Performs well in monolingual and cross-corpus testing scenarios.

03

Introduces a new English-French bilingual readability dataset.

Abstract

Automatic Readability Assessment (ARA), the task of assigning a reading level to a text, is traditionally treated as a classification problem in NLP research. In this paper, we propose the first neural, pairwise ranking approach to ARA and compare it with existing classification, regression, and (non-neural) ranking methods. We establish the performance of our model by conducting experiments with three English, one French and one Spanish datasets. We demonstrate that our approach performs well in monolingual single/cross corpus testing scenarios and achieves a zero-shot cross-lingual ranking accuracy of over 80% for both French and Spanish when trained on English data. Additionally, we also release a new parallel bilingual readability dataset in English and French. To our knowledge, this paper proposes the first neural pairwise ranking model for ARA, and shows the first results of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jlee118/nprm
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques