ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
Dennis Pinto, Jose-Mar\'ia Arnau, Antonio Gonz\'alez

TL;DR
This paper introduces ASRPU, a programmable hardware accelerator designed for low-power, real-time automatic speech recognition on edge devices, balancing flexibility and efficiency.
Contribution
The paper presents ASRPU, a novel programmable accelerator with general-purpose cores that simplifies on-edge ASR deployment while maintaining adaptability to evolving models.
Findings
Achieves real-time decoding on edge devices
Operates with very low power consumption
Supports flexible programming for different ASR models
Abstract
The outstanding accuracy achieved by modern Automatic Speech Recognition (ASR) systems is enabling them to quickly become a mainstream technology. ASR is essential for many applications, such as speech-based assistants, dictation systems and real-time language translation. However, highly accurate ASR systems are computationally expensive, requiring on the order of billions of arithmetic operations to decode each second of audio, which conflicts with a growing interest in deploying ASR on edge devices. On these devices, hardware acceleration is key for achieving acceptable performance. However, ASR is a rich and fast-changing field, and thus, any overly specialized hardware accelerator may quickly become obsolete. In this paper, we tackle those challenges by proposing ASRPU, a programmable accelerator for on-edge ASR. ASRPU contains a pool of general-purpose cores that execute small…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Speech and dialogue systems
