Automating Versatile Time-Series Analysis with Tiny Transformers on Embedded FPGAs

Tianheng Ling; Chao Qian; Lukas Johannes Ha{\ss}ler; Gregor Schiele

arXiv:2505.17662·cs.LG·September 22, 2025

Automating Versatile Time-Series Analysis with Tiny Transformers on Embedded FPGAs

Tianheng Ling, Chao Qian, Lukas Johannes Ha{\ss}ler, Gregor Schiele

PDF

TL;DR

This paper introduces an automated framework for deploying tiny, quantized Transformer models on embedded FPGAs, enabling efficient time-series analysis across multiple tasks with minimal energy consumption and latency.

Contribution

It presents a fully automated deployment pipeline combining quantization, hardware-aware hyperparameter tuning, and VHDL generation for versatile FPGA-based Tiny Transformers.

Findings

01

Achieves as low as 0.033 mJ per inference on Spartan-7

02

Supports multiple time-series tasks including forecasting, classification, and anomaly detection

03

Demonstrates deployment feasibility on resource-constrained embedded FPGA platforms.

Abstract

Transformer-based models have shown strong performance across diverse time-series tasks, but their deployment on resource-constrained devices remains challenging due to high memory and computational demand. While prior work targeting Microcontroller Units (MCUs) has explored hardware-specific optimizations, such approaches are often task-specific and limited to 8-bit fixed-point precision. Field-Programmable Gate Arrays (FPGAs) offer greater flexibility, enabling fine-grained control over data precision and architecture. However, existing FPGA-based deployments of Transformers for time-series analysis typically focus on high-density platforms with manual configuration. This paper presents a unified and fully automated deployment framework for Tiny Transformers on embedded FPGAs. Our framework supports a compact encoder-only Transformer architecture across three representative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.