MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts
Zinan Zeng, Sen Ye, Zijian Cai, Heng Wang, Yuhan Liu, Haokai Zhang, Minnan Luo

TL;DR
This paper introduces MMoE, a multi-modal neural network that leverages heterogeneous data sources and a mixture-of-experts architecture to improve the robustness and domain generalization of spoiler detection in online movie reviews.
Contribution
The paper proposes MMoE, a novel multi-modal and domain-aware model that integrates graph, text, and metadata features for more effective spoiler detection across genres.
Findings
Achieves state-of-the-art accuracy and F1-score on two datasets.
Outperforms previous methods by 2.56% and 8.41%.
Demonstrates superior robustness and generalization capabilities.
Abstract
Online movie review websites are valuable for information and discussion about movies. However, the massive spoiler reviews detract from the movie-watching experience, making spoiler detection an important task. Previous methods simply focus on reviews' text content, ignoring the heterogeneity of information in the platform. For instance, the metadata and the corresponding user's information of a review could be helpful. Besides, the spoiler language of movie reviews tends to be genre-specific, thus posing a domain generalization challenge for existing methods. To this end, we propose MMoE, a multi-modal network that utilizes information from multiple modalities to facilitate robust spoiler detection and adopts Mixture-of-Experts to enhance domain generalization. MMoE first extracts graph, text, and meta feature from the user-movie network, the review's textual content, and the review's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing
MethodsFocus
