An empirical study of task and feature correlations in the reuse of pre-trained models
Jama Hussein Mohamud, Willie Brink

TL;DR
This study investigates how the success of reusing pre-trained neural networks depends on task and feature correlations, revealing that correlations influence performance and optimal reuse strategies.
Contribution
It introduces an experimental framework to analyze factors affecting transfer success and demonstrates the impact of task and feature correlations on model reuse outcomes.
Findings
Performance increases with task correlation.
Reusing lower layers is better for uncorrelated tasks.
Semantic similarity enhances reuse effectiveness.
Abstract
Pre-trained neural networks are commonly used and reused in the machine learning community. Alice trains a model for a particular task, and a part of her neural network is reused by Bob for a different task, often to great effect. To what can we ascribe Bob's success? This paper introduces an experimental setup through which factors contributing to Bob's empirical success could be studied in silico. As a result, we demonstrate that Bob might just be lucky: his task accuracy increases monotonically with the correlation between his task and Alice's. Even when Bob has provably uncorrelated tasks and input features from Alice's pre-trained network, he can achieve significantly better than random performance due to Alice's choice of network and optimizer. When there is little correlation between tasks, only reusing lower pre-trained layers is preferable, and we hypothesize the converse: that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
