Tuning Word2vec for Large Scale Recommendation Systems

Benjamin P. Chamberlain; Emanuele Rossi; Dan Shiebler; Suvash Sedhain,; Michael M. Bronstein

arXiv:2009.12192·cs.IR·September 28, 2020

Tuning Word2vec for Large Scale Recommendation Systems

Benjamin P. Chamberlain, Emanuele Rossi, Dan Shiebler, Suvash Sedhain,, Michael M. Bronstein

PDF

TL;DR

This paper demonstrates that hyperparameter tuning significantly improves Word2vec's performance in large-scale recommendation systems, with methods that are efficient and scalable for real-world applications.

Contribution

It introduces constrained hyperparameter optimization techniques that enhance Word2vec's effectiveness in recommendation systems without excessive computational costs.

Findings

01

Unconstrained optimization improves hit rate by 221%.

02

Runtime-constrained optimization achieves 138% improvement.

03

Sampling-based hyperparameter tuning yields 91% improvement on full datasets.

Abstract

Word2vec is a powerful machine learning tool that emerged from Natural Lan-guage Processing (NLP) and is now applied in multiple domains, including recom-mender systems, forecasting, and network analysis. As Word2vec is often used offthe shelf, we address the question of whether the default hyperparameters are suit-able for recommender systems. The answer is emphatically no. In this paper, wefirst elucidate the importance of hyperparameter optimization and show that un-constrained optimization yields an average 221% improvement in hit rate over thedefault parameters. However, unconstrained optimization leads to hyperparametersettings that are very expensive and not feasible for large scale recommendationtasks. To this end, we demonstrate 138% average improvement in hit rate with aruntime budget-constrained hyperparameter optimization. Furthermore, to makehyperparameter optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.