AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air

Shiyi Yang; Xiaoxue Yu; Rongpeng Li; Jianhang Zhu; Zhifeng Zhao; Honggang Zhang

arXiv:2507.11515·cs.LG·July 16, 2025

AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air

Shiyi Yang, Xiaoxue Yu, Rongpeng Li, Jianhang Zhu, Zhifeng Zhao, Honggang Zhang

PDF

Open Access

TL;DR

AirLLM introduces a diffusion policy framework for adaptive LoRA configuration, enabling efficient remote fine-tuning of large language models over limited bandwidth channels by dynamically optimizing transmission and computational costs.

Contribution

The paper presents a novel hierarchical diffusion policy approach combining PPO and DDIM for adaptive LoRA rank configuration in remote LLM fine-tuning, improving efficiency and performance.

Findings

01

Enhanced fine-tuning accuracy under low SNR conditions

02

Significant reduction in transmission costs

03

Effective reinforcement-driven, diffusion-refined rank adaptation

Abstract

Operating Large Language Models (LLMs) on edge devices is increasingly challenged by limited communication bandwidth and strained computational and memory costs. Thus, cloud-assisted remote fine-tuning becomes indispensable. Nevertheless, existing Low-Rank Adaptation (LoRA) approaches typically employ fixed or heuristic rank configurations, and the subsequent over-the-air transmission of all LoRA parameters could be rather inefficient. To address this limitation, we develop AirLLM, a hierarchical diffusion policy framework for communication-aware LoRA adaptation. Specifically, AirLLM models the rank configuration as a structured action vector that spans all LoRA-inserted projections. To solve the underlying high-dimensional sequential decision-making problem, a Proximal Policy Optimization (PPO) agent generates coarse-grained decisions by jointly observing wireless states and linguistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Wireless Communication Techniques · Power Line Communications and Noise · Cooperative Communication and Network Coding

MethodsProximal Policy Optimization · Diffusion