Generating Educational Materials with Different Levels of Readability using LLMs
Chieh-Yang Huang, Jing Wei, Ting-Hao 'Kenneth' Huang

TL;DR
This paper explores the use of large language models to generate educational materials at different readability levels, assessing their effectiveness and challenges in maintaining content accuracy and appropriate difficulty.
Contribution
It introduces the leveled-text generation task and evaluates multiple LLMs' ability to produce content at specified readability levels with minimal supervision.
Findings
Few-shot prompting improves readability control and content preservation.
LLaMA-2 70B better achieves target difficulty levels.
GPT-3.5 better preserves original meaning.
Abstract
This study introduces the leveled-text generation task, aiming to rewrite educational materials to specific readability levels while preserving meaning. We assess the capability of GPT-3.5, LLaMA-2 70B, and Mixtral 8x7B, to generate content at various readability levels through zero-shot and few-shot prompting. Evaluating 100 processed educational materials reveals that few-shot prompting significantly improves performance in readability manipulation and information preservation. LLaMA-2 70B performs better in achieving the desired difficulty range, while GPT-3.5 maintains original meaning. However, manual inspection highlights concerns such as misinformation introduction and inconsistent edit distribution. These findings emphasize the need for further research to ensure the quality of generated educational content.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Cosine Annealing · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Adam · Attention Dropout · Weight Decay
