DPQ-HD: Post-Training Compression for Ultra-Low Power Hyperdimensional   Computing

Nilesh Prasad Pandey; Shriniwas Kulkarni; David Wang; Onat Gungor,; Flavio Ponzina; Tajana Rosing

arXiv:2505.05413·cs.LG·May 9, 2025

DPQ-HD: Post-Training Compression for Ultra-Low Power Hyperdimensional Computing

Nilesh Prasad Pandey, Shriniwas Kulkarni, David Wang, Onat Gungor,, Flavio Ponzina, Tajana Rosing

PDF

Open Access

TL;DR

DPQ-HD introduces a post-training compression method for hyperdimensional computing that significantly reduces memory and computation while maintaining high accuracy, enabling ultra-low power edge AI applications.

Contribution

It proposes a novel DPQ-HD algorithm combining decomposition, pruning, and quantization for post-training compression without retraining, and an energy-efficient inference method with early exit.

Findings

01

Achieves 20-100x memory reduction with 1-2% accuracy loss

02

Outperforms existing post-training compression methods

03

Enables faster inference and less optimization time

Abstract

Hyperdimensional Computing (HDC) is emerging as a promising approach for edge AI, offering a balance between accuracy and efficiency. However, current HDC-based applications often rely on high-precision models and/or encoding matrices to achieve competitive performance, which imposes significant computational and memory demands, especially for ultra-low power devices. While recent efforts use techniques like precision reduction and pruning to increase the efficiency, most require retraining to maintain performance, making them expensive and impractical. To address this issue, we propose a novel Post Training Compression algorithm, Decomposition-Pruning-Quantization (DPQ-HD), which aims at compressing the end-to-end HDC system, achieving near floating point performance without the need of retraining. DPQ-HD reduces computational and memory overhead by uniquely combining the above three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Parallel Computing and Optimization Techniques · Neural Networks and Reservoir Computing

MethodsPruning