Climber: Toward Efficient Scaling Laws for Large Recommendation Models

Songpei Xu; Shijia Wang; Da Guo; Xianwen Guo; Qiang Xiao; Bin Huang; Guanlin Wu; Chuanjiang Luo

arXiv:2502.09888·cs.IR·August 29, 2025

Climber: Toward Efficient Scaling Laws for Large Recommendation Models

Songpei Xu, Shijia Wang, Da Guo, Xianwen Guo, Qiang Xiao, Bin Huang, Guanlin Wu, Chuanjiang Luo

PDF

Open Access

TL;DR

Climber introduces a scalable, efficient recommendation framework with novel architecture and acceleration techniques, enabling better model scaling and online performance without high resource costs, validated through extensive experiments and deployment.

Contribution

The paper presents a new recommendation model architecture with multi-scale sequence extraction and dynamic attention modulation, along with acceleration methods, to improve scaling and efficiency.

Findings

01

Achieves 5.15× throughput gain without performance loss.

02

Demonstrates a 12.19% online metric lift in real deployment.

03

Validates superior scaling behavior across multiple datasets.

Abstract

Transformer-based generative models have achieved remarkable success across domains with various scaling law manifestations. However, our extensive experiments reveal persistent challenges when applying Transformer to recommendation systems: (1) Transformer scaling is not ideal with increased computational resources, due to structural incompatibilities with recommendation-specific features such as multi-source data heterogeneity; (2) critical online inference latency constraints (tens of milliseconds) that intensify with longer user behavior sequences and growing computational demands. We propose Climber, an efficient recommendation framework comprising two synergistic components: the model architecture for efficient scaling and the co-designed acceleration techniques. Our proposed model adopts two core innovations: (1) multi-scale sequence extraction that achieves a time complexity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Softmax