TsetlinKWS: A 65nm 16.58uW, 0.63mm2 State-Driven Convolutional Tsetlin Machine-Based Accelerator For Keyword Spotting
Baizhou Lin, Yuetong Fang, Renjing Xu, Rishad Shafik, Jagmohan Chauhan

TL;DR
This paper introduces TsetlinKWS, a low-power, efficient convolutional Tsetlin Machine-based accelerator for keyword spotting, achieving high accuracy and significant energy savings in speech recognition tasks.
Contribution
It presents a novel co-design framework combining feature extraction, sparse model compression, and a specialized architecture for CTMs, enabling ultra-low-power speech keyword spotting.
Findings
Achieved 87.35% accuracy on 12-keyword spotting.
Reduced model size by 9.84× using OG-BCSR algorithm.
Consumed only 16.58 μW in 65 nm technology, with 10× fewer logic operations.
Abstract
The Tsetlin Machine (TM) has recently attracted attention as a low-power alternative to neural networks due to its simple and interpretable inference mechanisms. However, its performance on speech-related tasks remains limited. This paper proposes TsetlinKWS, the first algorithm-hardware co-design framework for the Convolutional Tsetlin Machine (CTM) on the 12-keyword spotting task. Firstly, we introduce a novel Mel-Frequency Spectral Coefficient and Spectral Flux (MFSC-SF) feature extraction scheme together with spectral convolution, enabling the CTM to reach its first-ever competitive accuracy of 87.35% on the 12-keyword spotting task. Secondly, we develop an Optimized Grouped Block-Compressed Sparse Row (OG-BCSR) algorithm that achieves a remarkable 9.84 reduction in model size, significantly improving the storage efficiency on CTMs. Finally, we propose a state-driven…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
