Multi-Modal Recommendation System with Auxiliary Information
Mufhumudzi Muthivhi, Terence L. van Zyl, Hairong Wang

TL;DR
This paper enhances context-aware recommendation systems by integrating multi-modal auxiliary information, leading to significant improvements in prediction accuracy across different user sequence datasets.
Contribution
It introduces a method to incorporate comprehensive auxiliary knowledge using embeddings and transformer architectures, advancing beyond sequential context modeling.
Findings
4% increase in NDCG for long sequences
11% increase in NDCG for short sequences
Statistically significant accuracy improvements
Abstract
Context-aware recommendation systems improve upon classical recommender systems by including, in the modelling, a user's behaviour. Research into context-aware recommendation systems has previously only considered the sequential ordering of items as contextual information. However, there is a wealth of unexploited additional multi-modal information available in auxiliary knowledge related to items. This study extends the existing research by evaluating a multi-modal recommendation system that exploits the inclusion of comprehensive auxiliary knowledge related to an item. The empirical results explore extracting vector representations (embeddings) from unstructured and structured data using data2vec. The fused embeddings are then used to train several state-of-the-art transformer architectures for sequential user-item representations. The analysis of the experimental results shows a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Human Mobility and Location-Based Analysis · Image Retrieval and Classification Techniques
