Selection-p: Self-Supervised Task-Agnostic Prompt Compression for   Faithfulness and Transferability

Tsz Ting Chung; Leyang Cui; Lemao Liu; Xinting Huang; Shuming Shi,; Dit-Yan Yeung

arXiv:2410.11786·cs.CL·October 22, 2024

Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability

Tsz Ting Chung, Leyang Cui, Lemao Liu, Xinting Huang, Shuming Shi,, Dit-Yan Yeung

PDF

Open Access 1 Models 1 Video

TL;DR

Selection-p is a self-supervised prompt compression method that effectively reduces prompt size, maintains high task performance, and transfers well across models, addressing key limitations of prior approaches.

Contribution

It introduces a unified, self-supervised prompt compression technique that discretizes tokens and improves transferability without external data or model-specific tuning.

Findings

01

Achieves up to 10x compression with only 0.8% performance loss.

02

Outperforms prior methods in transferability across models.

03

Maintains performance on long-context in-context learning.

Abstract

Large Language Models (LLMs) have demonstrated impressive capabilities in a wide range of natural language processing tasks when leveraging in-context learning. To mitigate the additional computational and financial costs associated with in-context learning, several prompt compression methods have been proposed to compress the in-context learning prompts. Despite their success, these methods face challenges with transferability due to model-specific compression, or rely on external training data, such as GPT-4. In this paper, we investigate the ability of LLMs to develop a unified compression method that discretizes uninformative tokens, utilizing a self-supervised pre-training technique. By introducing a small number of parameters during the continual pre-training, the proposed Selection-p produces a probability for each input token, indicating whether to preserve or discard it.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
ttchungc/selection-p
model· 18 dl· ♡ 1
18 dl♡ 1

Videos

Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability· underline

Taxonomy

TopicsEEG and Brain-Computer Interfaces

MethodsAttention Is All You Need · Dropout · Layer Normalization · Adam · Dense Connections · Residual Connection · Position-Wise Feed-Forward Layer · Linear Layer · Byte Pair Encoding · Absolute Position Encodings