HyFormer: Revisiting the Roles of Sequence Modeling and Feature Interaction in CTR Prediction
Yunwen Huang, Shiyong Hong, Xijun Xiao, Jinqiu Jin, Xuanyuan Luo, Zhe Wang, Zheng Chai, Shikang Wu, Yuchao Zheng, Jingjian Lin

TL;DR
HyFormer introduces a unified transformer-based architecture that jointly models long user behavior sequences and heterogeneous features, significantly improving CTR prediction performance in large-scale industrial recommendation systems.
Contribution
It presents a novel integrated framework that combines sequence modeling and feature interaction into a single backbone, enhancing representation capacity and interaction flexibility.
Findings
Outperforms existing baselines like LONGER and RankMixer in industrial datasets.
Demonstrates superior scalability with increasing parameters and FLOPs.
Achieves significant online A/B test improvements over current production models.
Abstract
Industrial large-scale recommendation models (LRMs) face the challenge of jointly modeling long-range user behavior sequences and heterogeneous non-sequential features under strict efficiency constraints. However, most existing architectures employ a decoupled pipeline: long sequences are first compressed with a query-token based sequence compressor like LONGER, followed by fusion with dense features through token-mixing modules like RankMixer, which thereby limits both the representation capacity and the interaction flexibility. This paper presents HyFormer, a unified hybrid transformer architecture that tightly integrates long-sequence modeling and feature interaction into a single backbone. From the perspective of sequence modeling, we revisit and redesign query tokens in LRMs, and frame the LRM modeling task as an alternating optimization process that integrates two core components:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Machine Learning in Healthcare · Multimodal Machine Learning Applications
