MMBAttn: Max-Mean and Bit-wise Attention for CTR Prediction
Hasan Saribas, Cagri Yesil, Serdarcan Dilbaz, Halit Orenbas

TL;DR
This paper introduces MMBAttn, an attention mechanism combining max-mean pooling and bit-wise attention to improve feature importance estimation in CTR prediction, achieving state-of-the-art results.
Contribution
It presents a novel attention architecture that leverages bit-wise interactions along with max and mean pooling for better feature importance estimation in CTR models.
Findings
Significantly improves CTR prediction accuracy
Achieves state-of-the-art results on public datasets
Enhances feature importance estimation with bit-level attention
Abstract
With the increasing complexity and scale of click-through rate (CTR) prediction tasks in online advertising and recommendation systems, accurately estimating the importance of features has become a critical aspect of developing effective models. In this paper, we propose an attention-based approach that leverages max and mean pooling operations, along with a bit-wise attention mechanism, to enhance feature importance estimation in CTR prediction. Traditionally, pooling operations such as max and mean pooling have been widely used to extract relevant information from features. However, these operations can lead to information loss and hinder the accurate determination of feature importance. To address this challenge, we propose a novel attention architecture that utilizes a bit-based attention structure that emphasizes the relationships between all bits in features, together with maximum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Sentiment Analysis and Opinion Mining · Web Data Mining and Analysis
MethodsBalanced Selection
