Hash Embeddings for Efficient Word Representations

Dan Svenstrup; Jonas Meinertz Hansen; Ole Winther

arXiv:1709.03933·cs.CL·September 13, 2017·32 cites

Hash Embeddings for Efficient Word Representations

Dan Svenstrup, Jonas Meinertz Hansen, Ole Winther

PDF

Open Access

TL;DR

Hash embeddings offer an efficient way to represent words in vector form, handling large vocabularies without prior dictionary creation, and match or surpass regular embeddings in performance with fewer parameters.

Contribution

The paper introduces hash embeddings, a novel method combining hashing and embedding techniques to improve efficiency and scalability in word representation models.

Findings

01

Handle vocabularies with millions of tokens

02

Require fewer parameters than regular embeddings

03

Achieve comparable or better performance across tasks

Abstract

We present hash embeddings, an efficient method for representing words in a continuous vector form. A hash embedding may be seen as an interpolation between a standard word embedding and a word embedding created using a random hash function (the hashing trick). In hash embeddings each token is represented by $k$ $d$ -dimensional embeddings vectors and one $k$ dimensional weight vector. The final $d$ dimensional representation of the token is the product of the two. Rather than fitting the embedding vectors for each token these are selected by the hashing trick from a shared pool of $B$ embedding vectors. Our experiments show that hash embeddings can easily deal with huge vocabularies consisting of millions of tokens. When using a hash embedding there is no need to create a dictionary before training nor to perform any kind of vocabulary pruning after training. We show that models trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Image and Video Retrieval Techniques · Algorithms and Data Compression