InstantFT: An FPGA-Based Runtime Subsecond Fine-tuning of CNN Models
Keisuke Sugiura, Hiroki Matsutani

TL;DR
InstantFT is an FPGA-based method enabling ultra-fast, energy-efficient CNN fine-tuning on IoT devices, achieving subsecond adaptation times and comparable accuracy to existing approaches.
Contribution
The paper introduces InstantFT, a novel FPGA-based approach that significantly accelerates CNN fine-tuning for resource-limited IoT platforms using optimized PEFT techniques.
Findings
Fine-tunes CNN 17.4x faster than LoRA-based methods
Reduces fine-tuning time to 0.36 seconds
Improves energy efficiency by 16.3x
Abstract
Training deep neural networks (DNNs) requires significantly more computation and memory than inference, making runtime adaptation of DNNs challenging on resource-limited IoT platforms. We propose InstantFT, an FPGA-based method for ultra-fast CNN fine-tuning on IoT devices, by optimizing the forward and backward computations in parameter-efficient fine-tuning (PEFT). Experiments on datasets with concept drift demonstrate that InstantFT fine-tunes a pre-trained CNN 17.4x faster than existing Low-Rank Adaptation (LoRA)-based approaches, while achieving comparable accuracy. Our FPGA-based InstantFT reduces the fine-tuning time to just 0.36s and improves energy-efficiency by 16.3x, enabling on-the-fly adaptation of CNNs to non-stationary data distributions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Data Stream Mining Techniques · Adversarial Robustness in Machine Learning
