Transformers Meet ACT-R: Repeat-Aware and Sequential Listening Session Recommendation
Viet-Anh Tran, Guillaume Salha-Galvan, Bruno Sguerra, Romain Hennequin

TL;DR
This paper introduces PISA, a novel Transformer-based recommender system inspired by ACT-R, which effectively models repetitive listening behaviors in music sessions, improving prediction accuracy.
Contribution
PISA uniquely integrates ACT-R cognitive architecture with Transformer models to capture dynamic and repetitive user listening patterns in session-based music recommendation.
Findings
PISA outperforms existing methods on Last.fm and Deezer datasets.
Repetition modeling significantly improves recommendation accuracy.
Public release of dataset and source code to support future research.
Abstract
Music streaming services often leverage sequential recommender systems to predict the best music to showcase to users based on past sequences of listening sessions. Nonetheless, most sequential recommendation methods ignore or insufficiently account for repetitive behaviors. This is a crucial limitation for music recommendation, as repeatedly listening to the same song over time is a common phenomenon that can even change the way users perceive this song. In this paper, we introduce PISA (Psychology-Informed Session embedding using ACT-R), a session-level sequential recommender system that overcomes this limitation. PISA employs a Transformer architecture learning embedding representations of listening sessions and users using attention mechanisms inspired by Anderson's ACT-R (Adaptive Control of Thought-Rational), a cognitive architecture modeling human information access and memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
Methodstravel james · Linear Layer · Adam · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Residual Connection · Multi-Head Attention · Byte Pair Encoding
