From Tarzan to Tolkien: Controlling the Language Proficiency Level of   LLMs for Content Generation

Ali Malik; Stephen Mayhew; Chris Piech; Klinton Bicknell

arXiv:2406.03030·cs.CL·June 6, 2024·2 cites

From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation

Ali Malik, Stephen Mayhew, Chris Piech, Klinton Bicknell

PDF

Open Access 1 Models

TL;DR

This paper explores methods to control the difficulty level of text generated by large language models, aiming to make content suitable for language learners, and introduces a new model that outperforms existing solutions at lower cost.

Contribution

The paper presents CALM, a novel approach combining finetuning and reinforcement learning to effectively control language proficiency levels in LLM-generated content, surpassing GPT-4 performance.

Findings

01

Prompt-based strategies show large performance gaps between GPT-4 and open source models.

02

Finetuning and RL can bridge the performance gap effectively.

03

CALM outperforms GPT-4 and other methods in controlling language difficulty.

Abstract

We study the problem of controlling the difficulty level of text generated by Large Language Models (LLMs) for contexts where end-users are not fully proficient, such as language learners. Using a novel framework, we evaluate the effectiveness of several key approaches for this task, including few-shot prompting, supervised finetuning, and reinforcement learning (RL), utilising both GPT-4 and open source alternatives like LLama2-7B and Mistral-7B. Our findings reveal a large performance gap between GPT-4 and the open source models when using prompt-based strategies. However, we show how to bridge this gap with a careful combination of finetuning and RL alignment. Our best model, CALM (CEFR-Aligned Language Model), surpasses the performance of GPT-4 and other strategies, at only a fraction of the cost. We further validate the quality of our results through a small-scale human study.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
malikali/CEFR-Aligned-LM
model· 4 dl
4 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing

MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention