Position: The Need for Ultrafast Training
Duc Hoang

TL;DR
This paper advocates for developing ultrafast on-chip training for FPGAs to enable real-time learning and adaptation in high-frequency, non-stationary environments, expanding FPGA applications beyond static inference.
Contribution
It introduces the concept of integrating training directly into FPGA fabric, proposing a paradigm shift from inference-only accelerators to real-time learning systems.
Findings
Highlights the limitations of current static inference accelerators.
Proposes a new architecture for ultrafast on-chip training.
Discusses potential applications in scientific and industrial domains.
Abstract
Domain-specialized FPGAs have delivered unprecedented performance for low-latency inference across scientific and industrial workloads, yet nearly all existing accelerators assume static models trained offline, relegating learning and adaptation to slower CPUs or GPUs. This separation fundamentally limits systems that must operate in non-stationary, high-frequency environments, where model updates must occur at the timescale of the underlying physics. In this paper, I argue for a shift from inference-only accelerators to ultrafast on-chip learning, in which both inference and training execute directly within the FPGA fabric under deterministic, sub-microsecond latency constraints. Bringing learning into the same real-time datapath as inference would enable closed-loop systems that adapt as fast as the physical processes they control, with applications spanning quantum error correction,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Advanced Memory and Neural Computing
