Semantic Compression for Word and Sentence Embeddings using Discrete Wavelet Transform
Rana Aref Salama, Abdou Youssef, Mona Diab

TL;DR
This paper demonstrates that applying Discrete Wavelet Transforms to word and sentence embeddings can significantly compress their size while preserving semantic quality, enhancing NLP efficiency and performance.
Contribution
It introduces a novel application of DWT to NLP embeddings, showing effective compression and improved downstream task accuracy.
Findings
DWT reduces embedding dimensionality by up to 93%.
DWT maintains semantic similarity performance.
DWT improves accuracy in downstream NLP tasks.
Abstract
Wavelet transforms, a powerful mathematical tool, have been widely used in different domains, including Signal and Image processing, to unravel intricate patterns, enhance data representation, and extract meaningful features from data. Tangible results from their application suggest that Wavelet transforms can be applied to NLP capturing a variety of linguistic and semantic properties. In this paper, we empirically leverage the application of Discrete Wavelet Transforms (DWT) to word and sentence embeddings. We aim to showcase the capabilities of DWT in analyzing embedding representations at different levels of resolution and compressing them while maintaining their overall quality. We assess the effectiveness of DWT embeddings on semantic similarity tasks to show how DWT can be used to consolidate important semantic information in an embedding vector. We show the efficacy of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Handwritten Text Recognition Techniques · Advanced Graph Neural Networks
