MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts

Zinan Zeng; Sen Ye; Zijian Cai; Heng Wang; Yuhan Liu; Haokai Zhang; Minnan Luo

arXiv:2403.05265·cs.AI·September 8, 2025·1 cites

MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts

Zinan Zeng, Sen Ye, Zijian Cai, Heng Wang, Yuhan Liu, Haokai Zhang, Minnan Luo

PDF

Open Access

TL;DR

This paper introduces MMoE, a multi-modal neural network that leverages heterogeneous data sources and a mixture-of-experts architecture to improve the robustness and domain generalization of spoiler detection in online movie reviews.

Contribution

The paper proposes MMoE, a novel multi-modal and domain-aware model that integrates graph, text, and metadata features for more effective spoiler detection across genres.

Findings

01

Achieves state-of-the-art accuracy and F1-score on two datasets.

02

Outperforms previous methods by 2.56% and 8.41%.

03

Demonstrates superior robustness and generalization capabilities.

Abstract

Online movie review websites are valuable for information and discussion about movies. However, the massive spoiler reviews detract from the movie-watching experience, making spoiler detection an important task. Previous methods simply focus on reviews' text content, ignoring the heterogeneity of information in the platform. For instance, the metadata and the corresponding user's information of a review could be helpful. Besides, the spoiler language of movie reviews tends to be genre-specific, thus posing a domain generalization challenge for existing methods. To this end, we propose MMoE, a multi-modal network that utilizes information from multiple modalities to facilitate robust spoiler detection and adopts Mixture-of-Experts to enhance domain generalization. MMoE first extracts graph, text, and meta feature from the user-movie network, the review's textual content, and the review's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing

MethodsFocus