Hierarchical Structured Neural Network: Efficient Retrieval Scaling for Large Scale Recommendation
Kaushik Rangadurai, Siyang Yuan, Minhui Huang, Yiqun Liu, Golnaz, Ghasemiesfeh, Yunchen Pu, Haiyu Lu, Xingfeng He, Fangzhou Xu, Andrew Cui,, Vidhoon Viswanathan, Lin Yang, Liang Wang, Jiyan Yang, Chonglin Sun

TL;DR
This paper introduces the Hierarchical Structured Neural Network (HSNN), a scalable and expressive retrieval model that improves large-scale recommendation systems by capturing complex interactions and adapting to distribution shifts.
Contribution
The paper proposes HSNN, a novel hierarchical neural network with a modular design that enhances interaction learning and scales efficiently for large item corpora.
Findings
HSNN outperforms existing methods in offline evaluations.
It achieves sublinear computational costs relative to corpus size.
HSNN adapts to distribution shifts in user interests and item distributions.
Abstract
Retrieval, the initial stage of a recommendation system, is tasked with down-selecting items from a pool of tens of millions of candidates to a few thousands. Embedding Based Retrieval (EBR) has been a typical choice for this problem, addressing the computational demands of deep neural networks across vast item corpora. EBR utilizes Two Tower or Siamese Networks to learn representations for users and items, and employ Approximate Nearest Neighbor (ANN) search to efficiently retrieve relevant items. Despite its popularity in industry, EBR faces limitations. The Two Tower architecture, relying on a single dot product interaction, struggles to capture complex data distributions due to limited capability in learning expressive interactions between users and items. Additionally, ANN index building and representation learning for user and item are often separate, leading to inconsistencies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
