UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems
Mingming Ha, Guanchen Wang, Linxun Chen, Xuan Rao, Yuexin Shi, Tianbao Ma, Zhaojie Liu, Yunqian Fan, Zilong Lu, Yanan Niu, Han Li, Kun Gai

TL;DR
UniMixer introduces a unified architecture for recommendation models that unifies mainstream scaling blocks, optimizes token mixing, and enhances scaling efficiency through a lightweight module, validated by extensive experiments.
Contribution
The paper proposes UniMixer, a unified framework that unifies different recommendation model architectures and introduces a learnable token mixing module for improved scaling.
Findings
UniMixer achieves superior scaling performance in recommendation systems.
The UniMixing-Lite module reduces parameters and computational cost significantly.
Extensive experiments verify the effectiveness of UniMixer in both offline and online settings.
Abstract
In recent years, the scaling laws of recommendation models have attracted increasing attention, which govern the relationship between performance and parameters/FLOPs of recommenders. Currently, there are three mainstream architectures for achieving scaling in recommendation models, namely attention-based, TokenMixer-based, and factorization-machine-based methods, which exhibit fundamental differences in both design philosophy and architectural structure. In this paper, we propose a unified scaling architecture for recommendation systems, namely \textbf{UniMixer}, to improve scaling efficiency and establish a unified theoretical framework that unifies the mainstream scaling blocks. By transforming the rule-based TokenMixer to an equivalent parameterized structure, we construct a generalized parameterized feature mixing module that allows the token mixing patterns to be optimized and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
