SARM: LLM-Augmented Semantic Anchor for End-to-End Live-Streaming Ranking

Ruochen Yang; Yueyang Liu; Zijie Zhuang; Changxin Lao; Yuhui Zhang; Jiangxia Cao; Jia Xu; Xiang Chen; Haoke Xiao; Xiangyu Wu; Xiaoyou Zhou; Xiao Lv; Shuang Yang; Tingwen Liu; Zhaojie Liu; Han Li; Kun Gai

arXiv:2602.09401·cs.IR·February 11, 2026

SARM: LLM-Augmented Semantic Anchor for End-to-End Live-Streaming Ranking

Ruochen Yang, Yueyang Liu, Zijie Zhuang, Changxin Lao, Yuhui Zhang, Jiangxia Cao, Jia Xu, Xiang Chen, Haoke Xiao, Xiangyu Wu, Xiaoyou Zhou, Xiao Lv, Shuang Yang, Tingwen Liu, Zhaojie Liu, Han Li, Kun Gai

PDF

Open Access

TL;DR

SARM introduces an end-to-end ranking model that integrates natural-language semantic anchors with multimodal content for improved live-streaming recommendation accuracy, addressing limitations of previous methods.

Contribution

The paper presents SARM, a novel architecture that jointly optimizes semantic anchors with ranking features, enabling fine-grained, content-aware recommendations in real-time.

Findings

01

Consistent offline performance improvements.

02

Successful large-scale A/B testing results.

03

Deployed at scale serving over 400 million users daily.

Abstract

Large-scale live-streaming recommendation requires precise modeling of non-stationary content semantics under strict real-time serving constraints. In industrial deployment, two common approaches exhibit fundamental limitations: discrete semantic abstractions sacrifice descriptive precision through clustering, while dense multimodal embeddings are extracted independently and remain weakly aligned with ranking optimization, limiting fine-grained content-aware ranking. To address these limitations, we propose \textbf{SARM}, an end-to-end ranking architecture that integrates natural-language semantic anchors directly into ranking optimization, enabling fine-grained author representations conditioned on multimodal content. Each semantic anchor is represented as learnable text tokens jointly optimized with ranking features, allowing the model to adapt content descriptions to ranking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Video Analysis and Summarization · Multimodal Machine Learning Applications