TL;DR
This paper revisits feature interactions in CTR prediction through quadratic neural networks, proposing novel neuron formats and enhancements that achieve state-of-the-art results efficiently.
Contribution
It introduces the QNN-alpha neuron format and the Multi-Head Khatri-Rao Product, advancing feature interaction modeling for CTR prediction.
Findings
QNN-alpha achieves state-of-the-art performance on six datasets.
Mid-activation outperforms post-activation in QNN.
Proposed methods maintain low latency and good scalability.
Abstract
Hadamard Product (HP) has long been a cornerstone in click-through rate (CTR) prediction tasks due to its simplicity, effectiveness, and ability to capture feature interactions without additional parameters. However, the underlying reasons for its effectiveness remain unclear. In this paper, we revisit HP from the perspective of Quadratic Neural Networks (QNN), which leverage quadratic interaction terms to model complex feature relationships. We further reveal QNN's ability to expand the feature space and provide smooth nonlinear approximations without relying on activation functions. Meanwhile, we find that traditional post-activation does not further improve the performance of the QNN. Instead, mid-activation is a more suitable alternative. Through theoretical analysis and empirical evaluation of 25 QNN neuron formats, we identify a good-performing variant and make further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
