MealRec: Multi-granularity Sequential Modeling via Hierarchical Diffusion Models for Micro-Video Recommendation
Xinxin Dong, Haokai Ma, Yuze Zheng, Yongfu Zha, Yonghui Yang, Xiaodong Wang

TL;DR
MealRec introduces a hierarchical diffusion model that captures multi-granularity sequential preferences in micro-video recommendation, effectively addressing noise and modality conflicts for improved personalization.
Contribution
The paper proposes a novel hierarchical diffusion framework with TCD and NPD modules to enhance micro-video recommendation by modeling temporal and semantic preferences more accurately.
Findings
Outperforms existing methods on four micro-video datasets.
Demonstrates robustness against noisy and conflicting multimodal content.
Shows universality across different platform datasets.
Abstract
Micro-video recommendation aims to capture user preferences from the collaborative and context information of the interacted micro-videos, thereby predicting the appropriate videos. This target is often hindered by the inherent noise within multimodal content and unreliable implicit feedback, which weakens the correspondence between behaviors and underlying interests. While conventional works have predominantly approached such scenario through behavior-augmented modeling and content-centric multimodal analysis, these paradigms can inadvertently give rise to two non-trivial challenges: preference-irrelative video representation extraction and inherent modality conflicts. To address these issues, we propose a Multi-granularity sequential modeling method via hierarchical diffusion models for micro-video Recommendation (MealRec), which simultaneously considers temporal correlations during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Emotion and Mood Recognition · Visual Attention and Saliency Detection
