RingAda: Pipelining Large Model Fine-Tuning on Edge Devices with   Scheduled Layer Unfreezing

Liang Li; Xiaopei Chen; Wen Wu

arXiv:2502.19864·cs.DC·February 28, 2025

RingAda: Pipelining Large Model Fine-Tuning on Edge Devices with Scheduled Layer Unfreezing

Liang Li, Xiaopei Chen, Wen Wu

PDF

Open Access

TL;DR

RingAda is a novel framework enabling efficient, privacy-preserving fine-tuning of large transformer models on edge devices through scheduled layer unfreezing and collaborative pipeline training.

Contribution

It introduces a ring topology and pipeline mechanism for parameter-efficient, accelerated fine-tuning of large models on edge devices.

Findings

01

Reduces fine-tuning time significantly

02

Lowers memory costs during training

03

Maintains competitive model performance

Abstract

To enable large model (LM) based edge intelligent service provisioning, on-device fine-tuning with locally personalized data allows for continuous and privacy-preserving LM customization. In this paper, we propose RingAda, a collaborative training framework designed for fine-tuning transformer-based LMs on edge devices. Particularly, RingAda performs parameter-efficient adapter fine-tuning across a set of interconnected edge devices, forming a ring topology for per-batch training by sequentially placing frozen transformer blocks and their trainable adapter modules on the devices. RingAda follows a novel pipeline-parallel training mechanism with top-down adapter unfreezing, allowing for early-stopping of backpropagation at the lowest unfrozen adapter layer, thereby accelerating the fine-tuning process. Extensive experimental results demonstrate that RingAda significantly reduces…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices · IoT and Edge/Fog Computing