Semi-Supervised Joint Estimation of Word and Document Readability

Yoshinari Fujinuma; Masato Hagiwara

arXiv:2104.13103·cs.CL·April 28, 2021

Semi-Supervised Joint Estimation of Word and Document Readability

Yoshinari Fujinuma, Masato Hagiwara

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised graph convolutional network approach to jointly estimate word and document readability, leveraging their recursive relationship for improved accuracy and robustness with limited labeled data.

Contribution

It presents a novel semi-supervised GCN method that jointly models word and document difficulty, outperforming existing baselines.

Findings

01

Higher accuracy than strong baselines

02

Robust performance with less labeled data

03

Effective joint estimation of word and document difficulty

Abstract

Readability or difficulty estimation of words and documents has been investigated independently in the literature, often assuming the existence of extensive annotated resources for the other. Motivated by our analysis showing that there is a recursive relationship between word and document difficulty, we propose to jointly estimate word and document difficulty through a graph convolutional network (GCN) in a semi-supervised fashion. Our experimental results reveal that the GCN-based method can achieve higher accuracy than strong baselines, and stays robust even with a smaller amount of labeled data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

akkikiki/diff_joint_estimate
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Digital Accessibility for Disabilities