Optimising TinyML with Quantization and Distillation of Transformer and   Mamba Models for Indoor Localisation on Edge Devices

Thanaphon Suwannaphong; Ferdian Jovan; Ian Craddock; Ryan; McConville

arXiv:2412.09289·cs.LG·December 13, 2024

Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices

Thanaphon Suwannaphong, Ferdian Jovan, Ian Craddock, Ryan, McConville

PDF

Open Access

TL;DR

This paper develops and evaluates small, efficient machine learning models for indoor localisation on edge devices, using quantization and distillation techniques to balance model size and accuracy within strict memory constraints.

Contribution

It introduces a framework combining quantization and knowledge distillation for transformer and Mamba models, enabling effective indoor localisation on ultra-low-power MCUs.

Findings

01

Quantized transformer achieves good accuracy within 64 KB RAM.

02

Mamba model performs well at 32 KB RAM without compression.

03

Framework enables deployment of advanced models on resource-limited devices.

Abstract

This paper proposes small and efficient machine learning models (TinyML) for resource-constrained edge devices, specifically for on-device indoor localisation. Typical approaches for indoor localisation rely on centralised remote processing of data transmitted from lower powered devices such as wearables. However, there are several benefits for moving this to the edge device itself, including increased battery life, enhanced privacy, reduced latency and lowered operational costs, all of which are key for common applications such as health monitoring. The work focuses on model compression techniques, including quantization and knowledge distillation, to significantly reduce the model size while maintaining high predictive performance. We base our work on a large state-of-the-art transformer-based model and seek to deploy it within low-power MCUs. We also propose a state-space-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · Advanced Image and Video Retrieval Techniques

MethodsBalanced Selection · Mamba: Linear-Time Sequence Modeling with Selective State Spaces