AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air
Shiyi Yang, Xiaoxue Yu, Rongpeng Li, Jianhang Zhu, Zhifeng Zhao, Honggang Zhang

TL;DR
AirLLM introduces a diffusion policy framework for adaptive LoRA configuration, enabling efficient remote fine-tuning of large language models over limited bandwidth channels by dynamically optimizing transmission and computational costs.
Contribution
The paper presents a novel hierarchical diffusion policy approach combining PPO and DDIM for adaptive LoRA rank configuration in remote LLM fine-tuning, improving efficiency and performance.
Findings
Enhanced fine-tuning accuracy under low SNR conditions
Significant reduction in transmission costs
Effective reinforcement-driven, diffusion-refined rank adaptation
Abstract
Operating Large Language Models (LLMs) on edge devices is increasingly challenged by limited communication bandwidth and strained computational and memory costs. Thus, cloud-assisted remote fine-tuning becomes indispensable. Nevertheless, existing Low-Rank Adaptation (LoRA) approaches typically employ fixed or heuristic rank configurations, and the subsequent over-the-air transmission of all LoRA parameters could be rather inefficient. To address this limitation, we develop AirLLM, a hierarchical diffusion policy framework for communication-aware LoRA adaptation. Specifically, AirLLM models the rank configuration as a structured action vector that spans all LoRA-inserted projections. To solve the underlying high-dimensional sequential decision-making problem, a Proximal Policy Optimization (PPO) agent generates coarse-grained decisions by jointly observing wireless states and linguistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Wireless Communication Techniques · Power Line Communications and Noise · Cooperative Communication and Network Coding
MethodsProximal Policy Optimization · Diffusion
