On-Device Training of Fully Quantized Deep Neural Networks on Cortex-M   Microcontrollers

Mark Deutel; Frank Hannig; Christopher Mutschler; and J\"urgen Teich

arXiv:2407.10734·cs.LG·October 24, 2024

On-Device Training of Fully Quantized Deep Neural Networks on Cortex-M Microcontrollers

Mark Deutel, Frank Hannig, Christopher Mutschler, and J\"urgen Teich

PDF

Open Access

TL;DR

This paper introduces a method for fully quantized on-device training of deep neural networks on Cortex-M microcontrollers, enabling model adaptation with limited resources by using in-place training and dynamic gradient updates.

Contribution

It presents a novel approach for efficient, fully quantized DNN training directly on MCUs, addressing resource constraints and demonstrating practical feasibility.

Findings

01

Training accuracy tradeoffs with resource constraints

02

Feasibility demonstrated on vision and time-series datasets

03

Insights into energy, memory, and latency tradeoffs

Abstract

On-device training of DNNs allows models to adapt and fine-tune to newly collected data or changing domains while deployed on microcontroller units (MCUs). However, DNN training is a resource-intensive task, making the implementation and execution of DNN training algorithms on MCUs challenging due to low processor speeds, constrained throughput, limited floating-point support, and memory constraints. In this work, we explore on-device training of DNNs for Cortex-M MCUs. We present a method that enables efficient training of DNNs completely in place on the MCU using fully quantized training (FQT) and dynamic partial gradient updates. We demonstrate the feasibility of our approach on multiple vision and time-series datasets and provide insights into the tradeoff between training accuracy, memory overhead, energy, and latency on real hardware.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Neural Networks and Reservoir Computing