Semi-Supervised Joint Estimation of Word and Document Readability
Yoshinari Fujinuma, Masato Hagiwara

TL;DR
This paper introduces a semi-supervised graph convolutional network approach to jointly estimate word and document readability, leveraging their recursive relationship for improved accuracy and robustness with limited labeled data.
Contribution
It presents a novel semi-supervised GCN method that jointly models word and document difficulty, outperforming existing baselines.
Findings
Higher accuracy than strong baselines
Robust performance with less labeled data
Effective joint estimation of word and document difficulty
Abstract
Readability or difficulty estimation of words and documents has been investigated independently in the literature, often assuming the existence of extensive annotated resources for the other. Motivated by our analysis showing that there is a recursive relationship between word and document difficulty, we propose to jointly estimate word and document difficulty through a graph convolutional network (GCN) in a semi-supervised fashion. Our experimental results reveal that the GCN-based method can achieve higher accuracy than strong baselines, and stays robust even with a smaller amount of labeled data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Digital Accessibility for Disabilities
