Teaching the Pre-trained Model to Generate Simple Texts for Text   Simplification

Renliang Sun; Wei Xu; Xiaojun Wan

arXiv:2305.12463·cs.CL·May 23, 2023·2 cites

Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification

Renliang Sun, Wei Xu, Xiaojun Wan

PDF

Open Access 1 Repo

TL;DR

This paper introduces SimpleBART, a continued pre-training approach that enhances a pre-trained model's ability to generate simple texts, significantly improving performance across various text simplification tasks.

Contribution

The paper proposes a novel continued pre-training strategy to teach pre-trained models to generate simple texts, improving their effectiveness in text simplification tasks.

Findings

01

SimpleBART outperforms BART on multiple simplification tasks

02

Continued pre-training improves model's ability to generate simple texts

03

SimpleBART compares favorably with large language models

Abstract

Randomly masking text spans in ordinary texts in the pre-training stage hardly allows models to acquire the ability to generate simple texts. It can hurt the performance of pre-trained models on text simplification tasks. In this paper, we propose a new continued pre-training strategy to teach the pre-trained model to generate simple texts. We continue pre-training BART, a representative model, to obtain SimpleBART. It consistently and significantly improves the results on lexical simplification, sentence simplification, and document-level simplification tasks over BART. At the end, we compare SimpleBART with several representative large language models (LLMs).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rlsnlp/simplebart
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Adam · Dense Connections · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding