Low Rank Field-Weighted Factorization Machines for Low Latency Item   Recommendation

Alex Shtoff; Michael Viderman; Naama Haramaty-Krasne; Oren Somekh,; Ariel Raviv; Tularam Ban

arXiv:2408.00801·cs.IR·August 5, 2024

Low Rank Field-Weighted Factorization Machines for Low Latency Item Recommendation

Alex Shtoff, Michael Viderman, Naama Haramaty-Krasne, Oren Somekh,, Ariel Raviv, Tularam Ban

PDF

Open Access 1 Repo

TL;DR

This paper introduces a low-rank field-weighted factorization machine approach that significantly reduces inference cost in recommendation systems by focusing on item fields, outperforming heuristic pruning methods in speed and accuracy.

Contribution

The authors propose a low-rank decomposition method for FwFMs that decreases inference complexity from quadratic to linear in the number of fields, enhancing efficiency in low-latency systems.

Findings

01

Aggressive rank reduction outperforms pruning in accuracy and speed.

02

The method achieves faster inference in real-world online advertising systems.

03

Experimental results confirm the effectiveness of the low-rank approach.

Abstract

Factorization machine (FM) variants are widely used in recommendation systems that operate under strict throughput and latency requirements, such as online advertising systems. FMs are known both due to their ability to model pairwise feature interactions while being resilient to data sparsity, and their computational graphs that facilitate fast inference and training. Moreover, when items are ranked as a part of a query for each incoming user, these graphs facilitate computing the portion stemming from the user and context fields only once per query. Consequently, in terms of inference cost, the number of user or context fields is practically unlimited. More advanced FM variants, such as FwFM, provide better accuracy by learning a representation of field-wise interactions, but require computing all pairwise interaction terms explicitly. The computational cost during inference is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

michaelviderman/pytorch-fm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Text and Document Classification Technologies · Face and Expression Recognition

MethodsSparse Evolutionary Training · Pruning