Prompt-based Learning for Text Readability Assessment

Bruce W. Lee; Jason Hyung-Jong Lee

arXiv:2302.13139·cs.CL·June 18, 2024·1 cites

Prompt-based Learning for Text Readability Assessment

Bruce W. Lee, Jason Hyung-Jong Lee

PDF

Open Access 1 Repo

TL;DR

This paper explores adapting pre-trained seq2seq models like T5 and BART for text readability assessment, demonstrating high accuracy and offering insights into prompt design and training strategies for improved cross-domain performance.

Contribution

It introduces a novel prompt-based, pairwise ranking approach using seq2seq models for readability assessment, enhancing cross-domain generalization and providing practical training tips.

Findings

01

Achieved 99.6% accuracy on Newsela dataset.

02

Identified prompt formats significantly impact model performance.

03

Demonstrated effective use of multiple data sources for training.

Abstract

We propose the novel adaptation of a pre-trained seq2seq model for readability assessment. We prove that a seq2seq model - T5 or BART - can be adapted to discern which text is more difficult from two given texts (pairwise). As an exploratory study to prompt-learn a neural network for text readability in a text-to-text manner, we report useful tips for future work in seq2seq training and ranking-based approach to readability assessment. Specifically, we test nine input-output formats/prefixes and show that they can significantly influence the final model performance. Also, we argue that the combination of text-to-text training and pairwise ranking setup 1) enables leveraging multiple parallel text simplification data for teaching readability and 2) trains a neural model for the general concept of readability (therefore, better cross-domain generalization). At last, we report a 99.6%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brucewlee/prompt-learning-readability
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Tanh Activation · Adafactor · Gated Linear Unit · Softmax