A Unified Framework for Knowledge Transfer in Bidirectional Model Scaling
Jianlu Shen, Fu Feng, Jiaze Xu, Yucheng Xie, Jiaqi Lv, Xin Geng

TL;DR
This paper introduces BoT, a unified, size-agnostic framework for bidirectional knowledge transfer between models of different sizes, using wavelet transforms to efficiently scale models up or down.
Contribution
It presents the first unified framework that treats size scaling as signal processing, enabling efficient bidirectional transfer with a parameter-free approach.
Findings
Significant FLOPs savings up to 67.1% for S2L and 52.8% for L2S.
Achieves state-of-the-art results on GLUE and SQuAD benchmarks.
Applicable to models like DeiT, BERT, and GPT.
Abstract
Transferring pre-trained knowledge from a source model to a target model of a different architectural size is a key challenge for flexible and efficient model scaling. However, current parameter-space methods treat Small-to-Large (S2L) and Large-to-Small (L2S) scaling as separate, incompatible problems, focusing on parameter synthesis and selection, respectively. This fragmented perspective has resulted in specialized tools, hindering a unified, bidirectional framework. In this paper, we propose BoT (Bidirectional knowledge Transfer), the first size-agnostic framework to unify S2L and L2S scaling. Our core insight is to treat model weights as continuous signals, where models of different sizes represent distinct discretizations of the transferable knowledge. This multi-resolution perspective directly casts S2L and L2S scaling as the signal processing operations of upsampling and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
