Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices
Thanaphon Suwannaphong, Ferdian Jovan, Ian Craddock, Ryan, McConville

TL;DR
This paper develops and evaluates small, efficient machine learning models for indoor localisation on edge devices, using quantization and distillation techniques to balance model size and accuracy within strict memory constraints.
Contribution
It introduces a framework combining quantization and knowledge distillation for transformer and Mamba models, enabling effective indoor localisation on ultra-low-power MCUs.
Findings
Quantized transformer achieves good accuracy within 64 KB RAM.
Mamba model performs well at 32 KB RAM without compression.
Framework enables deployment of advanced models on resource-limited devices.
Abstract
This paper proposes small and efficient machine learning models (TinyML) for resource-constrained edge devices, specifically for on-device indoor localisation. Typical approaches for indoor localisation rely on centralised remote processing of data transmitted from lower powered devices such as wearables. However, there are several benefits for moving this to the edge device itself, including increased battery life, enhanced privacy, reduced latency and lowered operational costs, all of which are key for common applications such as health monitoring. The work focuses on model compression techniques, including quantization and knowledge distillation, to significantly reduce the model size while maintaining high predictive performance. We base our work on a large state-of-the-art transformer-based model and seek to deploy it within low-power MCUs. We also propose a state-space-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · Advanced Image and Video Retrieval Techniques
MethodsBalanced Selection · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
