DAS: Dual-Aligned Semantic IDs Empowered Industrial Recommender System

Wencai Ye; Mingjie Sun; Shaoyun Shi; Peng Wang; Wenjin Wu; Peng Jiang

arXiv:2508.10584·cs.IR·November 25, 2025

DAS: Dual-Aligned Semantic IDs Empowered Industrial Recommender System

Wencai Ye, Mingjie Sun, Shaoyun Shi, Peng Wang, Wenjin Wu, Peng Jiang

PDF

TL;DR

This paper introduces DAS, a one-stage dual-alignment method for semantic IDs in industrial recommender systems, improving semantic integrity and alignment efficiency over traditional two-stage approaches.

Contribution

DAS is a novel, flexible one-stage framework that jointly optimizes quantization and alignment, enhancing mutual information and reducing information loss in semantic ID generation.

Findings

01

DAS outperforms previous methods in offline and online evaluations.

02

Successfully deployed at Kuaishou serving over 400 million users daily.

03

Achieves better semantic and collaborative signal alignment.

Abstract

Semantic IDs are discrete identifiers generated by quantizing the Multi-modal Large Language Models (MLLMs) embeddings, enabling efficient multi-modal content integration in recommendation systems. However, their lack of collaborative signals results in a misalignment with downstream discriminative and generative recommendation objectives. Recent studies have introduced various alignment mechanisms to address this problem, but their two-stage framework design still leads to two main limitations: (1) inevitable information loss during alignment, and (2) inflexibility in applying adaptive alignment strategies, consequently constraining the mutual information maximization during the alignment process. To address these limitations, we propose a novel and flexible one-stage Dual-Aligned Semantic IDs (DAS) method that simultaneously optimizes quantization and alignment, preserving semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.