New Evaluation Paradigm for Lexical Simplification
Jipeng Qiang, Minjiang Huang, Yi Zhu, Yunhao Yuan, Chaowei Zhang,, Xiaoye Ouyang

TL;DR
This paper introduces a new evaluation paradigm for lexical simplification that leverages large language models and human-machine collaboration, resulting in improved performance over traditional methods.
Contribution
It proposes a novel all-in-one dataset construction method and a multi-LLMs collaboration approach for more effective lexical simplification evaluation.
Findings
Multi-LLMs approach outperforms existing baselines
New dataset construction method improves evaluation accuracy
LLMs can simplify sentences directly with a single prompt
Abstract
Lexical Simplification (LS) methods use a three-step pipeline: complex word identification, substitute generation, and substitute ranking, each with separate evaluation datasets. We found large language models (LLMs) can simplify sentences directly with a single prompt, bypassing the traditional pipeline. However, existing LS datasets are not suitable for evaluating these LLM-generated simplified sentences, as they focus on providing substitutes for single complex words without identifying all complex words in a sentence. To address this gap, we propose a new annotation method for constructing an all-in-one LS dataset through human-machine collaboration. Automated methods generate a pool of potential substitutes, which human annotators then assess, suggesting additional alternatives as needed. Additionally, we explore LLM-based methods with single prompts, in-context learning, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Natural Language Processing Techniques · linguistics and terminology studies
MethodsFocus
