Lexical Simplification with Pretrained Encoders
Jipeng Qiang, Yun Li, Yi Zhu, Yunhao Yuan, Xindong Wu

TL;DR
This paper introduces an unsupervised lexical simplification method using BERT that considers sentence context, significantly improving candidate quality and outperforming existing approaches by over 12 accuracy points.
Contribution
The paper proposes a novel BERT-based unsupervised approach for lexical simplification that incorporates sentence context during candidate generation.
Findings
Outperforms state-of-the-art by more than 12 accuracy points
Uses BERT to generate context-aware simplification candidates
Achieves significant improvements on three benchmark datasets
Abstract
Lexical simplification (LS) aims to replace complex words in a given sentence with their simpler alternatives of equivalent meaning. Recently unsupervised lexical simplification approaches only rely on the complex word itself regardless of the given sentence to generate candidate substitutions, which will inevitably produce a large number of spurious candidates. We present a simple LS approach that makes use of the Bidirectional Encoder Representations from Transformers (BERT) which can consider both the given sentence and the complex word during generating candidate substitutions for the complex word. Specifically, we mask the complex word of the original sentence for feeding into the BERT to predict the masked token. The predicted results will be used as candidate substitutions. Despite being entirely unsupervised, experimental results show that our approach obtains obvious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling
MethodsLinear Layer · Weight Decay · Residual Connection · Adam · Layer Normalization · Softmax · Attention Is All You Need · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention
