Category-Specific CNN for Visual-aware CTR Prediction at JD.com
Hu Liu, Jing Lu, Hao Yang, Xiwei Zhao, Sulong Xu, Hao Peng, Zehua, Zhang, Wenjie Niu, Xiaokun Zhu, Yongjun Bao, Weipeng Yan

TL;DR
This paper introduces a category-specific CNN model that integrates category information early in the network to improve visual-aware CTR prediction in e-commerce, demonstrating superior performance in both offline and online tests.
Contribution
The paper proposes CSCNN, a novel CNN architecture that incorporates category priors via an attention module for more effective visual feature extraction in CTR prediction.
Findings
CSCNN outperforms state-of-the-art algorithms in offline benchmarks.
CSCNN achieves significant improvements in online A/B testing.
The model effectively captures category-specific visual patterns.
Abstract
As one of the largest B2C e-commerce platforms in China, JD com also powers a leading advertising system, serving millions of advertisers with fingertip connection to hundreds of millions of customers. In our system, as well as most e-commerce scenarios, ads are displayed with images.This makes visual-aware Click Through Rate (CTR) prediction of crucial importance to both business effectiveness and user experience. Existing algorithms usually extract visual features using off-the-shelf Convolutional Neural Networks (CNNs) and late fuse the visual and non-visual features for the finally predicted CTR. Despite being extensively studied, this field still face two key challenges. First, although encouraging progress has been made in offline studies, applying CNNs in real systems remains non-trivial, due to the strict requirements for efficient end-to-end training and low-latency online…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
