MTGR: Industrial-Scale Generative Recommendation Framework in Meituan

Ruidong Han; Bin Yin; Shangyu Chen; He Jiang; Fei Jiang; Xiang Li; Chi Ma; Mincong Huang; Xiaoguang Li; Chunzhen Jing; Yueming Han; Menglei Zhou; Lei Yu; Chuan Liu; Wei Lin

arXiv:2505.18654·cs.IR·August 25, 2025

MTGR: Industrial-Scale Generative Recommendation Framework in Meituan

Ruidong Han, Bin Yin, Shangyu Chen, He Jiang, Fei Jiang, Xiang Li, Chi Ma, Mincong Huang, Xiaoguang Li, Chunzhen Jing, Yueming Han, Menglei Zhou, Lei Yu, Chuan Liu, Wei Lin

PDF

TL;DR

MTGR is a scalable generative recommendation framework that retains traditional model features, accelerates training and inference, and has been successfully deployed at Meituan, significantly improving efficiency and performance.

Contribution

The paper introduces MTGR, a novel recommendation framework that preserves cross features, accelerates training and inference, and supports large-scale deployment in industrial settings.

Findings

01

Achieved 65x FLOPs reduction for inference compared to DLRM.

02

Supported models with 10 to 100 times the computational complexity of DLRM.

03

Successfully deployed on Meituan's platform, handling main traffic.

Abstract

Scaling law has been extensively validated in many domains such as natural language processing and computer vision. In the recommendation system, recent work has adopted generative recommendations to achieve scalability, but their generative approaches require abandoning the carefully constructed cross features of traditional recommendation models. We found that this approach significantly degrades model performance, and scaling up cannot compensate for it at all. In this paper, we propose MTGR (Meituan Generative Recommendation) to address this issue. MTGR is modeling based on the HSTU architecture and can retain the original deep learning recommendation model (DLRM) features, including cross features. Additionally, MTGR achieves training and inference acceleration through user-level compression to ensure efficient scaling. We also propose Group-Layer Normalization (GLN) to enhance the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.