Black-box Prompt Learning for Pre-trained Language Models
Shizhe Diao, Zhichao Huang, Ruijia Xu, Xuechun Li, Yong Lin, Xiao, Zhou, Tong Zhang

TL;DR
This paper introduces a black-box prompt learning method for pre-trained language models that enables efficient task adaptation without model access, using a variance-reduced policy gradient approach to optimize discrete prompts in a cloud-device setting.
Contribution
We propose a novel black-box prompt learning framework that optimizes discrete prompts via policy gradients, enhancing privacy and security while maintaining high performance.
Findings
Achieves significant improvements on eight benchmarks.
Effectively tunes prompts with limited API calls.
Demonstrates transferability and interpretability of learned prompts.
Abstract
The increasing scale of general-purpose Pre-trained Language Models (PLMs) necessitates the study of more efficient adaptation across different downstream tasks. In this paper, we establish a Black-box Discrete Prompt Learning (BDPL) to resonate with pragmatic interactions between the cloud infrastructure and edge devices. Particularly, instead of fine-tuning the model in the cloud, we adapt PLMs by prompt learning, which efficiently optimizes only a few parameters of the discrete prompts. Moreover, we consider the scenario that we do not have access to the parameters and gradients of the pre-trained models, except for its outputs given inputs. This black-box setting secures the cloud infrastructure from potential attack and misuse to cause a single-point failure, which is preferable to the white-box counterpart by current infrastructures. Under this black-box constraint, we apply a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Cosine Annealing · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · BERT · Weight Decay
