InstantFT: An FPGA-Based Runtime Subsecond Fine-tuning of CNN Models

Keisuke Sugiura; Hiroki Matsutani

arXiv:2506.06505·cs.LG·June 10, 2025

InstantFT: An FPGA-Based Runtime Subsecond Fine-tuning of CNN Models

Keisuke Sugiura, Hiroki Matsutani

PDF

Open Access

TL;DR

InstantFT is an FPGA-based method enabling ultra-fast, energy-efficient CNN fine-tuning on IoT devices, achieving subsecond adaptation times and comparable accuracy to existing approaches.

Contribution

The paper introduces InstantFT, a novel FPGA-based approach that significantly accelerates CNN fine-tuning for resource-limited IoT platforms using optimized PEFT techniques.

Findings

01

Fine-tunes CNN 17.4x faster than LoRA-based methods

02

Reduces fine-tuning time to 0.36 seconds

03

Improves energy efficiency by 16.3x

Abstract

Training deep neural networks (DNNs) requires significantly more computation and memory than inference, making runtime adaptation of DNNs challenging on resource-limited IoT platforms. We propose InstantFT, an FPGA-based method for ultra-fast CNN fine-tuning on IoT devices, by optimizing the forward and backward computations in parameter-efficient fine-tuning (PEFT). Experiments on datasets with concept drift demonstrate that InstantFT fine-tunes a pre-trained CNN 17.4x faster than existing Low-Rank Adaptation (LoRA)-based approaches, while achieving comparable accuracy. Our FPGA-based InstantFT reduces the fine-tuning time to just 0.36s and improves energy-efficiency by 16.3x, enabling on-the-fly adaptation of CNNs to non-stationary data distributions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Data Stream Mining Techniques · Adversarial Robustness in Machine Learning