Human-Centered Multimodal Fusion for Sexism Detection in Memes with Eye-Tracking, Heart Rate, and EEG Signals
Iv\'an Arcos, Paolo Rosso, Elena Gomis-Vicent

TL;DR
This paper introduces a human-centered multimodal approach combining physiological data and content features to improve sexism detection in memes, achieving significant accuracy gains and better handling nuanced cases.
Contribution
It presents a novel fusion model that integrates eye-tracking, heart rate, EEG, and visual-language features, demonstrating enhanced detection of subtle and complex sexist content.
Findings
Physiological signals differ significantly between sexist and non-sexist meme processing.
The multimodal fusion model improves AUC by 3.4% over baseline.
Enhanced detection accuracy for nuanced categories like Misogyny.
Abstract
The automated detection of sexism in memes is a challenging task due to multimodal ambiguity, cultural nuance, and the use of humor to provide plausible deniability. Content-only models often fail to capture the complexity of human perception. To address this limitation, we introduce and validate a human-centered paradigm that augments standard content features with physiological data. We created a novel resource by recording Eye-Tracking (ET), Heart Rate (HR), and Electroencephalography (EEG) from 16 subjects (8 per experiment) while they viewed 3984 memes from the EXIST 2025 dataset. Our statistical analysis reveals significant physiological differences in how subjects process sexist versus non-sexist content. Sexist memes were associated with higher cognitive load, reflected in increased fixation counts and longer reaction times, as well as differences in EEG spectral power across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHumor Studies and Applications · Hate Speech and Cyberbullying Detection · Emotion and Mood Recognition
