Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks
Joan Serr\`a, Alexandros Karatzoglou

TL;DR
Bloom embeddings offer an efficient compression method for deep learning recommendation systems with high-dimensional sparse data, reducing size and training time while maintaining or improving accuracy.
Contribution
The paper introduces Bloom embeddings, a novel compression technique for neural networks handling sparse high-dimensional binary data, enhancing efficiency without altering core models.
Findings
Achieves up to 1/5 compression with minimal accuracy loss
In some cases, improves accuracy by up to 12%
Outperforms four alternative methods on seven datasets
Abstract
Recommendation algorithms that incorporate techniques from deep learning are becoming increasingly popular. Due to the structure of the data coming from recommendation domains (i.e., one-hot-encoded vectors of item preferences), these algorithms tend to have large input and output dimensionalities that dominate their overall size. This makes them difficult to train, due to the limited memory of graphical processing units, and difficult to deploy on mobile devices with limited hardware. To address these difficulties, we propose Bloom embeddings, a compression technique that can be applied to the input and output of neural network models dealing with sparse high-dimensional binary-coded instances. Bloom embeddings are computationally efficient, and do not seriously compromise the accuracy of the model up to 1/5 compression ratios. In some cases, they even improve over the original…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Caching and Content Delivery · Stochastic Gradient Optimization Techniques
