Balanced Co-Clustering of Users and Items for Embedding Table Compression in Recommender Systems
Runhao Jiang, Renchi Yang, Donghao Wu

TL;DR
BACO is a novel framework that compresses embedding tables in recommender systems by exploiting user-item interaction signals for balanced co-clustering, significantly reducing parameters with minimal accuracy loss.
Contribution
This paper introduces BACO, a fast, theoretically grounded co-clustering method that effectively compresses embeddings while maintaining high recommendation accuracy.
Findings
Reduces embedding parameters by over 75%.
Achieves up to 346X faster performance than baselines.
Maintains recall drop within 1.85% compared to full models.
Abstract
Recommender systems have advanced markedly over the past decade by transforming each user/item into a dense embedding vector with deep learning models. At industrial scale, embedding tables constituted by such vectors of all users/items demand a vast amount of parameters and impose heavy compute and memory overhead during training and inference, hindering model deployment under resource constraints. Existing solutions towards embedding compression either suffer from severely compromised recommendation accuracy or incur considerable computational costs. To mitigate these issues, this paper presents BACO, a fast and effective framework for compressing embedding tables. Unlike traditional ID hashing, BACO is built on the idea of exploiting collaborative signals in user-item interactions for user and item groupings, such that similar users/items share the same embeddings in the codebook.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
