Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in   Financial Sentiment Analysis

Sudhandar Balakrishnan; Yihao Fang; Xioadan Zhu

arXiv:2211.05584·cs.CL·November 11, 2022

Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis

Sudhandar Balakrishnan, Yihao Fang, Xioadan Zhu

PDF

Open Access

TL;DR

This paper investigates the robustness of prefix tuning versus fine-tuning in transformer models for financial sentiment analysis, revealing that prefix tuning is less robust to noisy data and exhibits higher variance in performance.

Contribution

The study provides the first systematic analysis of prefix tuning's robustness to noise in financial NLP tasks, highlighting its limitations compared to fine-tuning.

Findings

01

Fine-tuning is more robust to noise than prefix tuning.

02

Prefix tuning shows high variance in F1 scores across different corruptions.

03

Performance of prefix tuning significantly decreases with increasing noise levels.

Abstract

The invention of transformer-based models such as BERT, GPT, and RoBERTa has enabled researchers and financial companies to finetune these powerful models and use them in different downstream tasks to achieve state-of-the-art performance. Recently, a lightweight alternative (approximately 0.1% - 3% of the original model parameters) to fine-tuning, known as prefix tuning has been introduced. This method freezes the model parameters and only updates the prefix to achieve performance comparable to full fine-tuning. Prefix tuning enables researchers and financial practitioners to achieve similar results with much fewer parameters. In this paper, we explore the robustness of prefix tuning when facing noisy data. Our experiments demonstrate that fine-tuning is more robust to noise than prefix tuning -- the latter method faces a significant decrease in performance on most corrupted data sets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStock Market Forecasting Methods · Imbalanced Data Classification Techniques · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Adam · Layer Normalization · Cosine Annealing · Byte Pair Encoding · Residual Connection · Dropout · Linear Warmup With Cosine Annealing · WordPiece