Beyond End-to-End: Dynamic Chain Optimization for Private LLM Adaptation on the Edge

Yebo Wu; Jingguang Li; Chunlin Tian; Kahou Tam; Zhijiang Guo; Li Li

arXiv:2604.06819·cs.DC·April 9, 2026

Beyond End-to-End: Dynamic Chain Optimization for Private LLM Adaptation on the Edge

Yebo Wu, Jingguang Li, Chunlin Tian, Kahou Tam, Zhijiang Guo, Li Li

PDF

TL;DR

This paper introduces ChainFed, a layer-by-layer federated fine-tuning method for private LLM adaptation on edge devices, overcoming memory limitations and improving accuracy significantly.

Contribution

It proposes a novel sequential fine-tuning paradigm with three key techniques, outperforming existing methods in privacy-preserving edge LLM adaptation.

Findings

01

ChainFed achieves up to 46.46% accuracy improvement.

02

The method effectively addresses memory constraints on edge devices.

03

Extensive experiments validate its superiority over existing approaches.

Abstract

Federated fine-tuning enables privacy-preserving LLM adaptation but faces a critical bottleneck: the disparity between LLMs' high memory demands and edge devices' limited capacity. To break the memory barrier, we propose Chain Federated Fine-Tuning (ChainFed), an innovative paradigm that forgoes end-to-end updates in favor of a sequential, layer-by-layer manner. It first trains the initial adapter to convergence, freezes its weights, and then proceeds to the next. This iterative train-and-freeze process forms an optimization chain, gradually enhancing the model's task-specific proficiency. ChainFed further integrates three core techniques: 1) Dynamic Layer Co-Tuning to bridge semantic gaps between sequentially tuned layers and facilitate information flow; 2) Globally Perceptive Optimization to endow each adapter with foresight beyond its local objective; 3) Function-Oriented Adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.