Black-box Prompt Learning for Pre-trained Language Models

Shizhe Diao; Zhichao Huang; Ruijia Xu; Xuechun Li; Yong Lin; Xiao; Zhou; Tong Zhang

arXiv:2201.08531·cs.CL·February 24, 2023·30 cites

Black-box Prompt Learning for Pre-trained Language Models

Shizhe Diao, Zhichao Huang, Ruijia Xu, Xuechun Li, Yong Lin, Xiao, Zhou, Tong Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a black-box prompt learning method for pre-trained language models that enables efficient task adaptation without model access, using a variance-reduced policy gradient approach to optimize discrete prompts in a cloud-device setting.

Contribution

We propose a novel black-box prompt learning framework that optimizes discrete prompts via policy gradients, enhancing privacy and security while maintaining high performance.

Findings

01

Achieves significant improvements on eight benchmarks.

02

Effectively tunes prompts with limited API calls.

03

Demonstrates transferability and interpretability of learned prompts.

Abstract

The increasing scale of general-purpose Pre-trained Language Models (PLMs) necessitates the study of more efficient adaptation across different downstream tasks. In this paper, we establish a Black-box Discrete Prompt Learning (BDPL) to resonate with pragmatic interactions between the cloud infrastructure and edge devices. Particularly, instead of fine-tuning the model in the cloud, we adapt PLMs by prompt learning, which efficiently optimizes only a few parameters of the discrete prompts. Moreover, we consider the scenario that we do not have access to the parameters and gradients of the pre-trained models, except for its outputs given inputs. This black-box setting secures the cloud infrastructure from potential attack and misuse to cause a single-point failure, which is preferable to the white-box counterpart by current infrastructures. Under this black-box constraint, we apply a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shizhediao/black-box-prompt-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Cosine Annealing · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · BERT · Weight Decay