Does the Order of Fine-tuning Matter and Why?

Qihong Chen; Jiawei Li; Hyunjae Suh; Lianghao Jiang; Zheng Zhou,; Jingze Chen; Jiri Gesi; Iftekhar Ahmed

arXiv:2410.02915·cs.SE·October 7, 2024

Does the Order of Fine-tuning Matter and Why?

Qihong Chen, Jiawei Li, Hyunjae Suh, Lianghao Jiang, Zheng Zhou,, Jingze Chen, Jiri Gesi, Iftekhar Ahmed

PDF

Open Access

TL;DR

This paper empirically investigates how the order of multiple intermediate fine-tuning tasks affects the performance of language models on target tasks in Software Engineering, revealing performance variations up to 6%.

Contribution

It is the first study to analyze the impact of multiple task orderings on fine-tuning in Software Engineering, providing insights into optimal task sequences.

Findings

01

Task ordering impacts target task performance by up to 6%.

02

Dataset and task characteristics influence the effect of task order.

03

Insights help select cost-effective fine-tuning sequences.

Abstract

To improve the performance on a target task, researchers have fine-tuned language models with an intermediate task before the target task of interest. However, previous works have focused on the pre-trained language models and downstream tasks in Natural Language Processing (NLP) and considered only one intermediate task. The effect of fine-tuning multiple intermediate tasks and their ordering on target task performance has not been fully explored in Software Engineering. In this study, we perform the first empirical study on analyzing the impact of task ordering on target task performance. Experimental results show that there is an impact of task ordering on target task performance by up to 6% of performance gain and up to 4% of performance loss. To explain such an impact, we consider a variety of potential factors, including the characteristics of dataset (syntactic similarity and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Software Engineering Research · Mobile Crowdsensing and Crowdsourcing