TL;DR
RePlay is an open-source, scalable recommendation framework that streamlines transitioning from research to production by supporting multiple data processing stacks and enabling consistent, efficient model comparison.
Contribution
RePlay introduces an end-to-end, production-ready toolkit for recommender systems that supports flexible data processing and scalable deployment, bridging the gap between research and production.
Findings
Supports multiple data stacks: Pandas, Polars, Spark
Enables scalable computation and deployment
Facilitates consistent model comparison in production
Abstract
Using a single tool to build and compare recommender systems significantly reduces the time to market for new models. In addition, the comparison results when using such tools look more consistent. This is why many different tools and libraries for researchers in the field of recommendations have recently appeared. Unfortunately, most of these frameworks are aimed primarily at researchers and require modification for use in production due to the inability to work on large datasets or an inappropriate architecture. In this demo, we present our open-source toolkit RePlay - a framework containing an end-to-end pipeline for building recommender systems, which is ready for production use. RePlay also allows you to use a suitable stack for the pipeline on each stage: Pandas, Polars, or Spark. This allows the library to scale computations and deploy to a cluster. Thus, RePlay allows data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLib
