A Semi-Supervised Approach for Low-Resourced Text Generation

Hongyu Zang; Xiaojun Wan

arXiv:1906.00584·cs.CL·June 4, 2019·1 cites

A Semi-Supervised Approach for Low-Resourced Text Generation

Hongyu Zang, Xiaojun Wan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised method utilizing unlabeled data with denoising auto-encoders and reinforcement learning to improve low-resource text generation tasks, demonstrating significant performance gains.

Contribution

It presents a novel semi-supervised approach combining DAE and RL to leverage unlabeled data for low-resource text generation, which is adaptable across various tasks.

Findings

01

Significant performance improvements over basic models

02

Effective use of unlabeled data in low-resource settings

03

Adaptability across different text generation tasks

Abstract

Recently, encoder-decoder neural models have achieved great success on text generation tasks. However, one problem of this kind of models is that their performances are usually limited by the scale of well-labeled data, which are very expensive to get. The low-resource (of labeled data) problem is quite common in different task generation tasks, but unlabeled data are usually abundant. In this paper, we propose a method to make use of the unlabeled data to improve the performance of such models in the low-resourced circumstances. We use denoising auto-encoder (DAE) and language model (LM) based reinforcement learning (RL) to enhance the training of encoder and decoder with unlabeled data. Our method shows adaptability for different text generation tasks, and makes significant improvements over basic text generation models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhyack/UDRG
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification