MobileTL: On-device Transfer Learning with Inverted Residual Blocks
Hung-Yueh Chiang, Natalia Frumkin, Feng Liang, Diana Marculescu

TL;DR
MobileTL enables efficient on-device transfer learning for models with Inverted Residual Blocks by reducing memory and computation costs, making transfer learning feasible on resource-limited edge devices.
Contribution
The paper introduces MobileTL, a novel method that allows transfer learning with IRB-based models on edge devices by approximating backward computations and fine-tuning top layers.
Findings
Reduces memory usage by 46% and 53% for MobileNetV2 and V3 IRBs.
Achieves 36% FLOPs reduction with minimal accuracy loss on CIFAR10.
Demonstrates Pareto-optimal performance across multiple datasets.
Abstract
Transfer learning on edge is challenging due to on-device limited resources. Existing work addresses this issue by training a subset of parameters or adding model patches. Developed with inference in mind, Inverted Residual Blocks (IRBs) split a convolutional layer into depthwise and pointwise convolutions, leading to more stacking layers, e.g., convolution, normalization, and activation layers. Though they are efficient for inference, IRBs require that additional activation maps are stored in memory for training weights for convolution layers and scales for normalization layers. As a result, their high memory cost prohibits training IRBs on resource-limited edge devices, and making them unsuitable in the context of transfer learning. To address this issue, we present MobileTL, a memory and computationally efficient on-device transfer learning method for models built with IRBs. MobileTL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Ferroelectric and Negative Capacitance Devices · Machine Learning and ELM
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Sigmoid Activation · Pointwise Convolution · Dense Connections · Depthwise Convolution · Squeeze-and-Excitation Block · ReLU6 · Depthwise Separable Convolution · Inverted Residual Block
