Curriculum-Based Self-Training Makes Better Few-Shot Learners for   Data-to-Text Generation

Pei Ke; Haozhe Ji; Zhenyu Yang; Yi Huang; Junlan Feng; Xiaoyan Zhu,; Minlie Huang

arXiv:2206.02712·cs.CL·June 7, 2022

Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation

Pei Ke, Haozhe Ji, Zhenyu Yang, Yi Huang, Junlan Feng, Xiaoyan Zhu,, Minlie Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a curriculum-based self-training method that improves few-shot data-to-text generation by effectively leveraging unlabeled data and modeling the relationship between structured data and text.

Contribution

The paper proposes Curriculum-Based Self-Training (CBST), a novel approach that enhances few-shot learning in data-to-text generation by ordering unlabeled data based on difficulty.

Findings

01

CBST outperforms fine-tuning and task-adaptive pre-training methods.

02

Achieves state-of-the-art results in few-shot data-to-text generation.

03

Effectively leverages unlabeled data through curriculum learning.

Abstract

Despite the success of text-to-text pre-trained models in various natural language generation (NLG) tasks, the generation performance is largely restricted by the number of labeled data in downstream tasks, particularly in data-to-text generation tasks. Existing works mostly utilize abundant unlabeled structured data to conduct unsupervised pre-training for task adaption, which fail to model the complex relationship between source structured data and target texts. Thus, we introduce self-training as a better few-shot learner than task-adaptive pre-training, which explicitly captures this relationship via pseudo-labeled data generated by the pre-trained model. To alleviate the side-effect of low-quality pseudo-labeled data during self-training, we propose a novel method called Curriculum-Based Self-Training (CBST) to effectively leverage unlabeled data in a rearranged order determined by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kepei1106/cbst
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis