HyperFusion: Hierarchical Multimodal Ensemble Learning for Social Media Popularity Prediction
Liliang Ye (1), Yunyao Zhang (1), Yafeng Wu (1), Yi-Ping Phoebe Chen (2), Junqing Yu (1), Wei Yang (1), Zikai Song (1) ((1) Huazhong University of Science, Technology, Wuhan, China, (2) La Trobe University, Melbourne, Australia)

TL;DR
HyperFusion is a hierarchical ensemble framework that integrates visual, textual, and metadata features to improve social media popularity prediction, demonstrating competitive results and winning third place in a challenge.
Contribution
It introduces a novel three-tier fusion architecture with hierarchical ensemble strategies and a two-stage training process for social media popularity prediction.
Findings
Achieved third place in SMP Challenge 2025 (Image Track)
Demonstrated competitive performance on the SMP dataset
Proposed novel cross-modal similarity measures and clustering features
Abstract
Social media popularity prediction plays a crucial role in content optimization, marketing strategies, and user engagement enhancement across digital platforms. However, predicting post popularity remains challenging due to the complex interplay between visual, textual, temporal, and user behavioral factors. This paper presents HyperFusion, a hierarchical multimodal ensemble learning framework for social media popularity prediction. Our approach employs a three-tier fusion architecture that progressively integrates features across abstraction levels: visual representations from CLIP encoders, textual embeddings from transformer models, and temporal-spatial metadata with user characteristics. The framework implements a hierarchical ensemble strategy combining CatBoost, TabNet, and custom multi-layer perceptrons. To address limited labeled data, we propose a two-stage training methodology…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Complex Network Analysis Techniques · Mental Health via Writing
