Scaled Prompt-Tuning for Few-Shot Natural Language Generation

Ting Hu; Christoph Meinel; Haojin Yang

arXiv:2309.06759·cs.CL·September 14, 2023

Scaled Prompt-Tuning for Few-Shot Natural Language Generation

Ting Hu, Christoph Meinel, Haojin Yang

PDF

Open Access

TL;DR

This paper introduces Scaled Prompt-Tuning (SPT), a parameter-efficient method for few-shot natural language generation that improves performance and transferability without significantly increasing training costs.

Contribution

The paper proposes SPT, a novel PEFT approach that outperforms traditional prompt-tuning and demonstrates superior transferability in few-shot NLG tasks.

Findings

01

SPT surpasses conventional prompt-tuning in performance and generalization.

02

Intermediate SPT shows strong transferability in data-limited scenarios.

03

Some existing PEFT methods like Prefix-Tuning struggle on challenging few-shot NLG datasets.

Abstract

The increasingly Large Language Models (LLMs) demonstrate stronger language understanding and generation capabilities, while the memory demand and computation cost of fine-tuning LLMs on downstream tasks are non-negligible. Besides, fine-tuning generally requires a certain amount of data from individual tasks whilst data collection cost is another issue to consider in real-world applications. In this work, we focus on Parameter-Efficient Fine-Tuning (PEFT) methods for few-shot Natural Language Generation (NLG), which freeze most parameters in LLMs and tune a small subset of parameters in few-shot cases so that memory footprint, training cost, and labeling cost are reduced while maintaining or even improving the performance. We propose a Scaled Prompt-Tuning (SPT) method which surpasses conventional PT with better performance and generalization ability but without an obvious increase in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsFocus