Revisiting Vision-Language Features Adaptation and Inconsistency for Social Media Popularity Prediction
Chih-Chung Hsu, Chia-Ming Lee, Yu-Fan Lin, Yi-Shiuan Chou, Chih-Yu, Jian, Chi-Han Tsai

TL;DR
This paper investigates the limitations of pre-trained vision-language models like CLIP in social media popularity prediction, highlighting semantic inconsistencies between images and text that affect model performance and proposing adaptations to improve accuracy.
Contribution
It uncovers the semantic inconsistency issue in CLIP features for SMP prediction and demonstrates how adapting features and measuring inconsistency enhances prediction accuracy.
Findings
Semantic inconsistency increases with post popularity.
Incorporating inconsistency measures improves model performance.
Achieved SRC of 0.729 and MAE of 1.227 in experiments.
Abstract
Social media popularity (SMP) prediction is a complex task involving multi-modal data integration. While pre-trained vision-language models (VLMs) like CLIP have been widely adopted for this task, their effectiveness in capturing the unique characteristics of social media content remains unexplored. This paper critically examines the applicability of CLIP-based features in SMP prediction, focusing on the overlooked phenomenon of semantic inconsistency between images and text in social media posts. Through extensive analysis, we demonstrate that this inconsistency increases with post popularity, challenging the conventional use of VLM features. We provide a comprehensive investigation of semantic inconsistency across different popularity intervals and analyze the impact of VLM feature adaptation on SMP tasks. Our experiments reveal that incorporating inconsistency measures and adapted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Misinformation and Its Impacts
