PAE MobiLLM: Privacy-Aware and Efficient LLM Fine-Tuning on the Mobile Device via Additive Side-Tuning

Xingke Yang; Liang Li; Zhiyi Wan; Sicong Li; Xiaoqi Qi; Jiang Liu; Tomoaki Ohtsuki; Xin Fu; Miao Pan

arXiv:2507.01216·cs.LG·August 12, 2025

PAE MobiLLM: Privacy-Aware and Efficient LLM Fine-Tuning on the Mobile Device via Additive Side-Tuning

Xingke Yang, Liang Li, Zhiyi Wan, Sicong Li, Xiaoqi Qi, Jiang Liu, Tomoaki Ohtsuki, Xin Fu, Miao Pan

PDF

Open Access

TL;DR

PAE MobiLLM enables privacy-aware, efficient on-device LLM fine-tuning by leveraging additive side-tuning, activation caching, and communication reduction techniques, making it suitable for mobile devices with limited resources.

Contribution

It introduces a novel privacy-preserving, server-assisted additive side-tuning method with activation caching and shortcut techniques for efficient mobile LLM fine-tuning.

Findings

01

Outperforms existing methods in efficiency and privacy preservation.

02

Reduces communication costs significantly.

03

Achieves competitive fine-tuning accuracy on mobile devices.

Abstract

There is a huge gap between numerous intriguing applications fostered by on-device large language model (LLM) fine-tuning (FT) from fresh mobile data and the limited resources of a mobile device. While existing server-assisted methods (e.g., split learning or side-tuning) may enable LLM FT on the local mobile device, they suffer from heavy communication burdens of activation transmissions, and may disclose data and labels to the server. To address those issues, we develop PAE MobiLLM, a a privacy-aware and efficient LLM FT method which can be deployed on the mobile device via server-assisted additive side-tuning. To further accelerate FT convergence and improve computing efficiency, PAE MobiLLM integrates activation caching on the server side, which allows the server to reuse historical activations and saves the mobile device from repeatedly computing forward passes for the recurring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCaching and Content Delivery · Big Data and Digital Economy · IoT and Edge/Fog Computing