TL;DR
This paper introduces a framework for learning backward compatible embeddings that enable frequent embedding updates without requiring consumer models to retrain, demonstrated through six methods evaluated on real-world recommender systems.
Contribution
We formalize the problem of embedding version updates with backward compatibility and propose BC-Aligner, a method that maintains compatibility while preserving task performance.
Findings
BC-Aligner maintains backward compatibility after multiple updates
BC-Aligner achieves similar task performance to specialized embedding models
Six methods were systematically evaluated on real-world data
Abstract
Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks (e.g., product recommendation). The produced embeddings are then widely consumed by consumer teams to solve their unintended tasks (e.g., fraud detection). However, as the embedding model gets updated and retrained to improve performance on the intended task, the newly-generated embeddings are no longer compatible with the existing consumer models. This means that historical versions of the embeddings can never be retired or all consumer teams have to retrain their models to make them compatible with the latest version of the embeddings, both of which are extremely costly in practice. Here we study the problem of embedding version updates and their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
