DPQ-HD: Post-Training Compression for Ultra-Low Power Hyperdimensional Computing
Nilesh Prasad Pandey, Shriniwas Kulkarni, David Wang, Onat Gungor,, Flavio Ponzina, Tajana Rosing

TL;DR
DPQ-HD introduces a post-training compression method for hyperdimensional computing that significantly reduces memory and computation while maintaining high accuracy, enabling ultra-low power edge AI applications.
Contribution
It proposes a novel DPQ-HD algorithm combining decomposition, pruning, and quantization for post-training compression without retraining, and an energy-efficient inference method with early exit.
Findings
Achieves 20-100x memory reduction with 1-2% accuracy loss
Outperforms existing post-training compression methods
Enables faster inference and less optimization time
Abstract
Hyperdimensional Computing (HDC) is emerging as a promising approach for edge AI, offering a balance between accuracy and efficiency. However, current HDC-based applications often rely on high-precision models and/or encoding matrices to achieve competitive performance, which imposes significant computational and memory demands, especially for ultra-low power devices. While recent efforts use techniques like precision reduction and pruning to increase the efficiency, most require retraining to maintain performance, making them expensive and impractical. To address this issue, we propose a novel Post Training Compression algorithm, Decomposition-Pruning-Quantization (DPQ-HD), which aims at compressing the end-to-end HDC system, achieving near floating point performance without the need of retraining. DPQ-HD reduces computational and memory overhead by uniquely combining the above three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Parallel Computing and Optimization Techniques · Neural Networks and Reservoir Computing
MethodsPruning
