Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis
Sudhandar Balakrishnan, Yihao Fang, Xioadan Zhu

TL;DR
This paper investigates the robustness of prefix tuning versus fine-tuning in transformer models for financial sentiment analysis, revealing that prefix tuning is less robust to noisy data and exhibits higher variance in performance.
Contribution
The study provides the first systematic analysis of prefix tuning's robustness to noise in financial NLP tasks, highlighting its limitations compared to fine-tuning.
Findings
Fine-tuning is more robust to noise than prefix tuning.
Prefix tuning shows high variance in F1 scores across different corruptions.
Performance of prefix tuning significantly decreases with increasing noise levels.
Abstract
The invention of transformer-based models such as BERT, GPT, and RoBERTa has enabled researchers and financial companies to finetune these powerful models and use them in different downstream tasks to achieve state-of-the-art performance. Recently, a lightweight alternative (approximately 0.1% - 3% of the original model parameters) to fine-tuning, known as prefix tuning has been introduced. This method freezes the model parameters and only updates the prefix to achieve performance comparable to full fine-tuning. Prefix tuning enables researchers and financial practitioners to achieve similar results with much fewer parameters. In this paper, we explore the robustness of prefix tuning when facing noisy data. Our experiments demonstrate that fine-tuning is more robust to noise than prefix tuning -- the latter method faces a significant decrease in performance on most corrupted data sets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Imbalanced Data Classification Techniques · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Layer Normalization · Cosine Annealing · Byte Pair Encoding · Residual Connection · Dropout · Linear Warmup With Cosine Annealing · WordPiece
