TL;DR
This paper introduces a novel projected Hamming dissimilarity measure that incorporates binary importance weighting of hash codes, improving collaborative filtering performance without extra storage or computational cost.
Contribution
It proposes a new dissimilarity measure and a variational hashing model optimized for it, enhancing the effectiveness of hash-based collaborative filtering methods.
Findings
Up to +7% in NDCG performance
Up to +14% in MRR performance
No additional storage or computational overhead
Abstract
When reasoning about tasks that involve large amounts of data, a common approach is to represent data items as objects in the Hamming space where operations can be done efficiently and effectively. Object similarity can then be computed by learning binary representations (hash codes) of the objects and computing their Hamming distance. While this is highly efficient, each bit dimension is equally weighted, which means that potentially discriminative information of the data is lost. A more expressive alternative is to use real-valued vector representations and compute their inner product; this allows varying the weight of each dimension but is many magnitudes slower. To fix this, we derive a new way of measuring the dissimilarity between two objects in the Hamming space with binary weighting of each dimension (i.e., disabling bits): we consider a field-agnostic dissimilarity that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
