Hash Embeddings for Efficient Word Representations
Dan Svenstrup, Jonas Meinertz Hansen, Ole Winther

TL;DR
Hash embeddings offer an efficient way to represent words in vector form, handling large vocabularies without prior dictionary creation, and match or surpass regular embeddings in performance with fewer parameters.
Contribution
The paper introduces hash embeddings, a novel method combining hashing and embedding techniques to improve efficiency and scalability in word representation models.
Findings
Handle vocabularies with millions of tokens
Require fewer parameters than regular embeddings
Achieve comparable or better performance across tasks
Abstract
We present hash embeddings, an efficient method for representing words in a continuous vector form. A hash embedding may be seen as an interpolation between a standard word embedding and a word embedding created using a random hash function (the hashing trick). In hash embeddings each token is represented by -dimensional embeddings vectors and one dimensional weight vector. The final dimensional representation of the token is the product of the two. Rather than fitting the embedding vectors for each token these are selected by the hashing trick from a shared pool of embedding vectors. Our experiments show that hash embeddings can easily deal with huge vocabularies consisting of millions of tokens. When using a hash embedding there is no need to create a dictionary before training nor to perform any kind of vocabulary pruning after training. We show that models trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Image and Video Retrieval Techniques · Algorithms and Data Compression
