MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and   Modalities

Kunxi Li; Tianyu Zhan; Kairui Fu; Shengyu Zhang; Kun Kuang; Jiwei Li,; Zhou Zhao; Fan Wu; Fei Wu

arXiv:2404.13322·cs.LG·December 30, 2024

MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities

Kunxi Li, Tianyu Zhan, Kairui Fu, Shengyu Zhang, Kun Kuang, Jiwei Li,, Zhou Zhao, Fan Wu, Fei Wu

PDF

Open Access 1 Video

TL;DR

MergeNet introduces a novel framework for transferring knowledge across diverse models, tasks, and modalities by learning to map parameter spaces, enabling effective knowledge transfer where traditional methods struggle.

Contribution

The paper proposes MergeNet, a method that bridges heterogeneous model parameter spaces for knowledge transfer, overcoming limitations of existing approaches that rely on shared structures.

Findings

01

Significant improvements in heterogeneous knowledge transfer tasks.

02

Effective transfer across different model architectures, tasks, and modalities.

03

Outperforms existing methods in challenging transfer scenarios.

Abstract

In this study, we focus on heterogeneous knowledge transfer across entirely different model architectures, tasks, and modalities. Existing knowledge transfer methods (e.g., backbone sharing, knowledge distillation) often hinge on shared elements within model structures or task-specific features/labels, limiting transfers to complex model types or tasks. To overcome these challenges, we present MergeNet, which learns to bridge the gap of parameter spaces of heterogeneous models, facilitating the direct interaction, extraction, and application of knowledge within these parameter spaces. The core mechanism of MergeNet lies in the parameter adapter, which operates by querying the source model's low-rank parameters and adeptly learning to identify and map parameters into the target model. MergeNet is learned alongside both models, allowing our framework to dynamically transfer and adapt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities· underline

Taxonomy

TopicsCollaboration in agile enterprises

MethodsFocus