On the Relationship between Skill Neurons and Robustness in Prompt   Tuning

Leon Ackermann; Xenia Ohmer

arXiv:2309.12263·cs.CL·March 26, 2024

On the Relationship between Skill Neurons and Robustness in Prompt Tuning

Leon Ackermann, Xenia Ohmer

PDF

Open Access 1 Repo

TL;DR

This paper investigates the relationship between skill neurons and the robustness of prompt tuning in large language models, finding that models with more consistent skill neuron activation tend to be more adversarially robust.

Contribution

It demonstrates the existence of skill neurons in T5, compares their robustness to RoBERTa, and links consistent skill neuron activation to adversarial robustness.

Findings

01

Prompt tuning prompts are transferable within task types.

02

Prompts for RoBERTa are less robust to adversarial data.

03

Prompts for T5 show slightly better robustness and skill neuron consistency.

Abstract

Prompt Tuning is a popular parameter-efficient finetuning method for pre-trained large language models (PLMs). Based on experiments with RoBERTa, it has been suggested that Prompt Tuning activates specific neurons in the transformer's feed-forward networks, that are highly predictive and selective for the given task. In this paper, we study the robustness of Prompt Tuning in relation to these "skill neurons", using RoBERTa and T5. We show that prompts tuned for a specific task are transferable to tasks of the same type but are not very robust to adversarial data. While prompts tuned for RoBERTa yield below-chance performance on adversarial data, prompts tuned for T5 are slightly more robust and retain above-chance performance in two out of three cases. At the same time, we replicate the finding that skill neurons exist in RoBERTa and further show that skill neurons also exist in T5.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leonackermann/robust-neurons
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Motor Control and Adaptation · EEG and Brain-Computer Interfaces

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · WordPiece · Linear Warmup With Linear Decay · Adam · BERT · Linear Layer · Layer Normalization · Multi-Head Attention