MTGR: Industrial-Scale Generative Recommendation Framework in Meituan
Ruidong Han, Bin Yin, Shangyu Chen, He Jiang, Fei Jiang, Xiang Li, Chi Ma, Mincong Huang, Xiaoguang Li, Chunzhen Jing, Yueming Han, Menglei Zhou, Lei Yu, Chuan Liu, Wei Lin

TL;DR
MTGR is a scalable generative recommendation framework that retains traditional model features, accelerates training and inference, and has been successfully deployed at Meituan, significantly improving efficiency and performance.
Contribution
The paper introduces MTGR, a novel recommendation framework that preserves cross features, accelerates training and inference, and supports large-scale deployment in industrial settings.
Findings
Achieved 65x FLOPs reduction for inference compared to DLRM.
Supported models with 10 to 100 times the computational complexity of DLRM.
Successfully deployed on Meituan's platform, handling main traffic.
Abstract
Scaling law has been extensively validated in many domains such as natural language processing and computer vision. In the recommendation system, recent work has adopted generative recommendations to achieve scalability, but their generative approaches require abandoning the carefully constructed cross features of traditional recommendation models. We found that this approach significantly degrades model performance, and scaling up cannot compensate for it at all. In this paper, we propose MTGR (Meituan Generative Recommendation) to address this issue. MTGR is modeling based on the HSTU architecture and can retain the original deep learning recommendation model (DLRM) features, including cross features. Additionally, MTGR achieves training and inference acceleration through user-level compression to ensure efficient scaling. We also propose Group-Layer Normalization (GLN) to enhance the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
