A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification
Sorouralsadat Fatemi, Yuheng Hu, Maryam Mousavi

TL;DR
This paper evaluates instruction fine-tuning of smaller LLMs for financial text classification, demonstrating improved performance and robustness, especially when combined with model merging techniques for zero-shot tasks.
Contribution
It introduces a comprehensive comparison of instruction fine-tuning on smaller LLMs for financial NLP tasks and proposes model merging to enhance zero-shot capabilities.
Findings
Instruction fine-tuning improves financial text classification performance.
Model merging enhances zero-shot generalization beyond original models.
Instruction-tuned models are more robust to degradation on unseen tasks.
Abstract
Large Language Models (LLMs) have demonstrated impressive capabilities across diverse Natural Language Processing (NLP) tasks, including language understanding, reasoning, and generation. However, general-domain LLMs often struggle with financial tasks due to the technical and specialized nature of financial texts. This study investigates the efficacy of instruction fine-tuning smaller-scale LLMs, including Mistral-7B, Llama3-8B, and Phi3-mini, to enhance their performance in financial text classification tasks. We fine-tuned both instruction-tuned and base models across four financial classification tasks, achieving significant improvements in task-specific performance. Furthermore, we evaluated the zero-shot capabilities of these fine-tuned models on three unseen complex financial tasks, including argument classification, deal completeness classification, and causal classification.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Technology and Data Analysis · Advanced Decision-Making Techniques
MethodsBalanced Selection
