An Efficient Attention Mechanism for Sequential Recommendation Tasks: HydraRec
Uzma Mushtaque

TL;DR
HydraRec introduces an efficient linear attention-based transformer model for sequential recommendation tasks, significantly reducing computational complexity while maintaining or improving predictive performance compared to existing models.
Contribution
The paper presents HydraRec, a novel linear attention transformer model tailored for sequential recommendation, offering improved efficiency and comparable or better accuracy.
Findings
HydraRec outperforms other linear attention models in accuracy.
HydraRec achieves faster training and inference times.
Performance is comparable to BERT4Rec with reduced computational cost.
Abstract
Transformer based models are increasingly being used in various domains including recommender systems (RS). Pretrained transformer models such as BERT have shown good performance at language modelling. With the greater ability to model sequential tasks, variants of Encoder-only models (like BERT4Rec, SASRec etc.) have found success in sequential RS problems. Computing dot-product attention in traditional transformer models has quadratic complexity in sequence length. This is a bigger problem with RS because unlike language models, new items are added to the catalogue every day. User buying history is a dynamic sequence which depends on multiple factors. Recently, various linear attention models have tried to solve this problem by making the model linear in sequence length (token dimensions). Hydra attention is one such linear complexity model proposed for vision transformers which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Attention Dropout · Absolute Position Encodings · Linear Layer · Softmax · Dense Connections · Linear Warmup With Linear Decay · Dropout
