TL;DR
This paper introduces LATTICE, a novel method that mines latent item-item structures from multimodal content to improve multimedia recommendation accuracy by explicitly modeling high-order item relationships.
Contribution
The paper proposes a new structure learning layer that captures latent item-item relationships across multiple modalities, enhancing recommendation performance.
Findings
LATTICE outperforms state-of-the-art methods on three real-world datasets.
Latent item-item structures significantly boost recommendation accuracy.
Explicit modeling of high-order item affinities improves item representation quality.
Abstract
Multimedia content is of predominance in the modern Web era. Investigating how users interact with multimodal items is a continuing concern within the rapid development of recommender systems. The majority of previous work focuses on modeling user-item interactions with multimodal features included as side information. However, this scheme is not well-designed for multimedia recommendation. Specifically, only collaborative item-item relationships are implicitly modeled through high-order item-user-item relations. Considering that items are associated with rich contents in multiple modalities, we argue that the latent semantic item-item structures underlying these multimodal contents could be beneficial for learning better item representations and further boosting recommendation. To this end, we propose a LATent sTructure mining method for multImodal reCommEndation, which we term LATTICE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
