Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation
Pei Ke, Haozhe Ji, Zhenyu Yang, Yi Huang, Junlan Feng, Xiaoyan Zhu,, Minlie Huang

TL;DR
This paper introduces a curriculum-based self-training method that improves few-shot data-to-text generation by effectively leveraging unlabeled data and modeling the relationship between structured data and text.
Contribution
The paper proposes Curriculum-Based Self-Training (CBST), a novel approach that enhances few-shot learning in data-to-text generation by ordering unlabeled data based on difficulty.
Findings
CBST outperforms fine-tuning and task-adaptive pre-training methods.
Achieves state-of-the-art results in few-shot data-to-text generation.
Effectively leverages unlabeled data through curriculum learning.
Abstract
Despite the success of text-to-text pre-trained models in various natural language generation (NLG) tasks, the generation performance is largely restricted by the number of labeled data in downstream tasks, particularly in data-to-text generation tasks. Existing works mostly utilize abundant unlabeled structured data to conduct unsupervised pre-training for task adaption, which fail to model the complex relationship between source structured data and target texts. Thus, we introduce self-training as a better few-shot learner than task-adaptive pre-training, which explicitly captures this relationship via pseudo-labeled data generated by the pre-trained model. To alleviate the side-effect of low-quality pseudo-labeled data during self-training, we propose a novel method called Curriculum-Based Self-Training (CBST) to effectively leverage unlabeled data in a rearranged order determined by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
