TinyTL: Reduce Activations, Not Trainable Parameters for Efficient   On-Device Learning

Han Cai; Chuang Gan; Ligeng Zhu; Song Han

arXiv:2007.11622·cs.CV·June 8, 2021·33 cites

TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning

Han Cai, Chuang Gan, Ligeng Zhu, Song Han

PDF

Open Access 1 Repo

TL;DR

TinyTL introduces a memory-efficient on-device learning method that freezes weights and learns only bias modules, significantly reducing memory usage while maintaining high accuracy.

Contribution

The paper proposes TinyTL, a novel approach that reduces memory consumption by freezing weights and learning lightweight bias modules, enabling efficient on-device adaptation.

Findings

01

Up to 6.5x memory savings with minimal accuracy loss

02

Significant accuracy improvements over last-layer fine-tuning

03

7.3-12.9x memory reduction with maintained accuracy

Abstract

On-device learning enables edge devices to continually adapt the AI models to new data, which requires a small memory footprint to fit the tight memory constraint of edge devices. Existing work solves this problem by reducing the number of trainable parameters. However, this doesn't directly translate to memory saving since the major bottleneck is the activations, not parameters. In this work, we present Tiny-Transfer-Learning (TinyTL) for memory-efficient on-device learning. TinyTL freezes the weights while only learns the bias modules, thus no need to store the intermediate activations. To maintain the adaptation capacity, we introduce a new memory-efficient bias module, the lite residual module, to refine the feature extractor by learning small residual feature maps adding only 3.8% memory overhead. Extensive experiments show that TinyTL significantly saves the memory (up to 6.5x)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mit-han-lab/tinyML
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · IoT and Edge/Fog Computing