Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models

Yuqiao Tan; Shizhu He; Kang Liu; Jun Zhao

arXiv:2505.14436·cs.CL·May 21, 2025

Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models

Yuqiao Tan, Shizhu He, Kang Liu, Jun Zhao

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the fundamental challenges of transferring knowledge across large language models of different scales, revealing neural incompatibility as a key obstacle and proposing new alignment paradigms to address it.

Contribution

It introduces the concepts of Post-Align PKT and Pre-Align PKT, along with the LaTen method, to improve cross-scale parametric knowledge transfer in LLMs.

Findings

01

Alignment in parametric space is essential for successful cross-scale PKT.

02

Neural incompatibility stems from structural differences between models of different scales.

03

Proposed methods face challenges in achieving stable transfer, highlighting fundamental limitations.

Abstract

Large Language Models (LLMs) offer a transparent brain with accessible parameters that encode extensive knowledge, which can be analyzed, located and transferred. Consequently, a key research challenge is to transcend traditional knowledge transfer paradigms rooted in symbolic language and achieve genuine Parametric Knowledge Transfer (PKT). Significantly, exploring effective methods for transferring knowledge across LLMs of different scales through parameters presents an intriguing and valuable research direction. In this paper, we first demonstrate $Alignment$ in parametric space is the fundamental prerequisite to achieve successful cross-scale PKT. We redefine the previously explored knowledge transfer as Post-Align PKT (PostPKT), which utilizes extracted parameters for LoRA initialization and requires subsequent fine-tune for alignment. Hence, to reduce cost for further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trae1oung/neural_incompatibility
pytorchOfficial

Videos

Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques