Exploring Scaling Laws of CTR Model for Online Performance Improvement

Weijiang Lai; Beihong Jin; Jiongyan Zhang; Yiyuan Zheng; Jian Dong; Jia Cheng; Jun Lei; Xingxing Wang

arXiv:2508.15326·cs.IR·September 22, 2025

Exploring Scaling Laws of CTR Model for Online Performance Improvement

Weijiang Lai, Beihong Jin, Jiongyan Zhang, Yiyuan Zheng, Jian Dong, Jia Cheng, Jun Lei, Xingxing Wang

PDF

TL;DR

This paper introduces a scalable CTR prediction model called SUAN, inspired by LLM scaling laws, and demonstrates how knowledge distillation can produce lightweight models that improve online performance without increasing inference time.

Contribution

The paper proposes a new paradigm for CTR modeling based on scaling laws, introduces the SUAN model with a novel behavior sequence encoder, and applies online distillation to create efficient lightweight models for deployment.

Findings

01

SUAN demonstrates performance scaling across three orders of magnitude.

02

Distilled LightSUAN outperforms higher-grade SUAN models.

03

Online deployment of LightSUAN increases CTR by 2.81%.

Abstract

CTR models play a vital role in improving user experience and boosting business revenue in many online personalized services. However, current CTR models generally encounter bottlenecks in performance improvement. Inspired by the scaling law phenomenon of LLMs, we propose a new paradigm for improving CTR predictions: first, constructing a CTR model with accuracy scalable to the model grade and data size, and then distilling the knowledge implied in this model into its lightweight model that can serve online users. To put it into practice, we construct a CTR model named SUAN (Stacked Unified Attention Network). In SUAN, we propose the UAB as a behavior sequence encoder. A single UAB unifies the modeling of the sequential and non-sequential features and also measures the importance of each user behavior feature from multiple perspectives. Stacked UABs elevate the configuration to a high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.