RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction
Ziye Tong, Jiahao Liu, Weimin Zhang, Hongji Ruan, Derick Tang, Zhanpeng Zeng, Qinsong Zeng, Peng Zhang, Tun Lu, Ning Gu

TL;DR
This paper introduces RQ-GMM, a probabilistic residual quantization method that improves semantic discretization of multimodal embeddings, leading to better CTR prediction and significant online performance gains.
Contribution
The paper presents RQ-GMM, a novel probabilistic residual quantization approach using Gaussian Mixture Models for improved semantic discretization in CTR prediction.
Findings
Achieves higher codebook utilization and reconstruction accuracy.
Demonstrates a 1.502% increase in Advertiser Value in online A/B tests.
Successfully deployed for large-scale daily recommendations.
Abstract
Multimodal content is crucial for click-through rate (CTR) prediction. However, directly incorporating continuous embeddings from pre-trained models into CTR models yields suboptimal results due to misaligned optimization objectives and convergence speed inconsistency during joint training. Discretizing embeddings into semantic IDs before feeding them into CTR models offers a more effective solution, yet existing methods suffer from limited codebook utilization, reconstruction accuracy, and semantic discriminability. We propose RQ-GMM (Residual Quantized Gaussian Mixture Model), which introduces probabilistic modeling to better capture the statistical structure of multimodal embedding spaces. Through Gaussian Mixture Models combined with residual quantization, RQ-GMM achieves superior codebook utilization and reconstruction accuracy. Experiments on public datasets and online A/B tests…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Recommender Systems and Techniques · Consumer Market Behavior and Pricing
