TL;DR
This paper introduces CL4CVR, a contrastive learning framework that enhances conversion rate prediction by leveraging unlabeled data and addressing data sparsity through novel embedding masking, false negative elimination, and positive inclusion techniques.
Contribution
The paper proposes a novel contrastive learning framework for CVR prediction that effectively utilizes unlabeled data and tackles data sparsity with new embedding masking, false negative elimination, and positive inclusion methods.
Findings
CL4CVR outperforms baseline models on real-world datasets.
Embedding masking improves data augmentation for CVR prediction.
False negative elimination enhances contrastive learning effectiveness.
Abstract
Conversion rate (CVR) prediction plays an important role in advertising systems. Recently, supervised deep neural network-based models have shown promising performance in CVR prediction. However, they are data hungry and require an enormous amount of training data. In online advertising systems, although there are millions to billions of ads, users tend to click only a small set of them and to convert on an even smaller set. This data sparsity issue restricts the power of these deep models. In this paper, we propose the Contrastive Learning for CVR prediction (CL4CVR) framework. It associates the supervised CVR prediction task with a contrastive learning task, which can learn better data representations exploiting abundant unlabeled data and improve the CVR prediction performance. To tailor the contrastive learning task to the CVR prediction problem, we propose embedding masking (EM),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
