DAF: An Efficient End-to-End Dynamic Activation Framework for on-Device DNN Training
Renyuan Liu, Yuyang Leng, Kaiyan Liu, Shaohan Hu, Chun-Fu (Richard) Chen, Peijun Zhao, Heechul Yun, Shuochao Yao

TL;DR
DAF is a system that significantly reduces memory usage and speeds up on-device deep neural network training by employing system-level optimizations like hybrid reduction, collaborative bit-packing, and importance-aware paging.
Contribution
The paper introduces DAF, a novel framework that enables efficient on-device DNN training through system-level optimizations for activation compression and memory management.
Findings
Up to 22.9x memory reduction during training.
Achieves 3.2x speedup on mobile platforms.
Maintains training accuracy while reducing resource consumption.
Abstract
Recent advancements in on-device training for deep neural networks have underscored the critical need for efficient activation compression to overcome the memory constraints of mobile and edge devices. As activations dominate memory usage during training and are essential for gradient computation, compressing them without compromising accuracy remains a key research challenge. While existing methods for dynamic activation quantization promise theoretical memory savings, their practical deployment is impeded by system-level challenges such as computational overhead and memory fragmentation. To address these challenges, we introduce DAF, a Dynamic Activation Framework that enables scalable and efficient on-device training through system-level optimizations. DAF achieves both memory- and time-efficient dynamic quantization training by addressing key system bottlenecks. It develops hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Big Data and Digital Economy
