Low Rank Field-Weighted Factorization Machines for Low Latency Item Recommendation
Alex Shtoff, Michael Viderman, Naama Haramaty-Krasne, Oren Somekh,, Ariel Raviv, Tularam Ban

TL;DR
This paper introduces a low-rank field-weighted factorization machine approach that significantly reduces inference cost in recommendation systems by focusing on item fields, outperforming heuristic pruning methods in speed and accuracy.
Contribution
The authors propose a low-rank decomposition method for FwFMs that decreases inference complexity from quadratic to linear in the number of fields, enhancing efficiency in low-latency systems.
Findings
Aggressive rank reduction outperforms pruning in accuracy and speed.
The method achieves faster inference in real-world online advertising systems.
Experimental results confirm the effectiveness of the low-rank approach.
Abstract
Factorization machine (FM) variants are widely used in recommendation systems that operate under strict throughput and latency requirements, such as online advertising systems. FMs are known both due to their ability to model pairwise feature interactions while being resilient to data sparsity, and their computational graphs that facilitate fast inference and training. Moreover, when items are ranked as a part of a query for each incoming user, these graphs facilitate computing the portion stemming from the user and context fields only once per query. Consequently, in terms of inference cost, the number of user or context fields is practically unlimited. More advanced FM variants, such as FwFM, provide better accuracy by learning a representation of field-wise interactions, but require computing all pairwise interaction terms explicitly. The computational cost during inference is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Text and Document Classification Technologies · Face and Expression Recognition
MethodsSparse Evolutionary Training · Pruning
