Learning Backward Compatible Embeddings

Weihua Hu; Rajas Bansal; Kaidi Cao; Nikhil Rao; Karthik Subbian; Jure; Leskovec

arXiv:2206.03040·stat.ML·June 8, 2022

Learning Backward Compatible Embeddings

Weihua Hu, Rajas Bansal, Kaidi Cao, Nikhil Rao, Karthik Subbian, Jure, Leskovec

PDF

1 Repo

TL;DR

This paper introduces a framework for learning backward compatible embeddings that enable frequent embedding updates without requiring consumer models to retrain, demonstrated through six methods evaluated on real-world recommender systems.

Contribution

We formalize the problem of embedding version updates with backward compatibility and propose BC-Aligner, a method that maintains compatibility while preserving task performance.

Findings

01

BC-Aligner maintains backward compatibility after multiple updates

02

BC-Aligner achieves similar task performance to specialized embedding models

03

Six methods were systematically evaluated on real-world data

Abstract

Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks (e.g., product recommendation). The produced embeddings are then widely consumed by consumer teams to solve their unintended tasks (e.g., fraud detection). However, as the embedding model gets updated and retrained to improve performance on the intended task, the newly-generated embeddings are no longer compatible with the existing consumer models. This means that historical versions of the embeddings can never be retired or all consumer teams have to retrain their models to make them compatible with the latest version of the embeddings, both of which are extremely costly in practice. Here we study the problem of embedding version updates and their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

snap-stanford/bc-emb
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.