Multi-Epoch learning with Data Augmentation for Deep Click-Through Rate Prediction
Zhongxiang Fan, Zhaocheng Liu, Jian Liang, Dongying Kong, Han Li, Peng, Jiang, Shuang Li, Kun Gai

TL;DR
This paper introduces MEDA, a novel multi-epoch training framework with data augmentation for deep CTR models, addressing overfitting and improving performance over traditional single-epoch training.
Contribution
MEDA is the first multi-epoch training strategy for deep CTR prediction, reducing overfitting and enhancing model performance through embedding space augmentation and MLP adaptability.
Findings
MEDA outperforms single-epoch training on multiple datasets.
Pre-trained MLP layers adapt to new embedding spaces effectively.
MEDA benefits real-world online advertising systems.
Abstract
This paper investigates the one-epoch overfitting phenomenon in Click-Through Rate (CTR) models, where performance notably declines at the start of the second epoch. Despite extensive research, the efficacy of multi-epoch training over the conventional one-epoch approach remains unclear. We identify the overfitting of the embedding layer, caused by high-dimensional data sparsity, as the primary issue. To address this, we introduce a novel and simple Multi-Epoch learning with Data Augmentation (MEDA) framework, suitable for both non-continual and continual learning scenarios, which can be seamlessly integrated into existing deep CTR models and may have potential applications to handle the "forgetting or overfitting" dilemma in the retraining and the well-known catastrophic forgetting problems. MEDA minimizes overfitting by reducing the dependency of the embedding layer on subsequent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment
