Automating Versatile Time-Series Analysis with Tiny Transformers on Embedded FPGAs
Tianheng Ling, Chao Qian, Lukas Johannes Ha{\ss}ler, Gregor Schiele

TL;DR
This paper introduces an automated framework for deploying tiny, quantized Transformer models on embedded FPGAs, enabling efficient time-series analysis across multiple tasks with minimal energy consumption and latency.
Contribution
It presents a fully automated deployment pipeline combining quantization, hardware-aware hyperparameter tuning, and VHDL generation for versatile FPGA-based Tiny Transformers.
Findings
Achieves as low as 0.033 mJ per inference on Spartan-7
Supports multiple time-series tasks including forecasting, classification, and anomaly detection
Demonstrates deployment feasibility on resource-constrained embedded FPGA platforms.
Abstract
Transformer-based models have shown strong performance across diverse time-series tasks, but their deployment on resource-constrained devices remains challenging due to high memory and computational demand. While prior work targeting Microcontroller Units (MCUs) has explored hardware-specific optimizations, such approaches are often task-specific and limited to 8-bit fixed-point precision. Field-Programmable Gate Arrays (FPGAs) offer greater flexibility, enabling fine-grained control over data precision and architecture. However, existing FPGA-based deployments of Transformers for time-series analysis typically focus on high-density platforms with manual configuration. This paper presents a unified and fully automated deployment framework for Tiny Transformers on embedded FPGAs. Our framework supports a compact encoder-only Transformer architecture across three representative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
