SML:Enhance the Network Smoothness with Skip Meta Logit for CTR Prediction
Wenlong Deng, Lang Lang, Zhen Liu, Bin Liu

TL;DR
This paper introduces Skip Meta Logit, a novel method that enhances network smoothness for CTR prediction by incorporating skip connections and normalization, leading to improved accuracy and stability in real-world applications.
Contribution
The paper proposes Skip Meta Logit with Meta Tanh Normalization, providing a new way to improve deep neural network training stability and performance for CTR prediction.
Findings
Achieved incremental performance boosts on state-of-the-art models.
Proved the absence of spurious local optima in deep skip logit networks.
Delivered offline and online metrics improvements in TikTok's ad system.
Abstract
In light of the smoothness property brought by skip connections in ResNet, this paper proposed the Skip Logit to introduce the skip connection mechanism that fits arbitrary DNN dimensions and embraces similar properties to ResNet. Meta Tanh Normalization (MTN) is designed to learn variance information and stabilize the training process. With these delicate designs, our Skip Meta Logit (SML) brought incremental boosts to the performance of extensive SOTA ctr prediction models on two real-world datasets. In the meantime, we prove that the optimization landscape of arbitrarily deep skip logit networks has no spurious local optima. Finally, SML can be easily added to building blocks and has delivered offline accuracy and online business metrics gains on app ads learning to rank systems at TikTok.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Traffic Prediction and Management Techniques · Data Stream Mining Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · 1x1 Convolution · Residual Connection · Kaiming Initialization · Max Pooling · Average Pooling · Global Average Pooling · Bottleneck Residual Block · Residual Block
