AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Zixin Chen; Hongzhan Lin; Kaixin Li; Ziyang Luo; Zhen Ye; Guang Chen; Zhiyong Huang; Jing Ma

arXiv:2507.01702·cs.CL·July 3, 2025

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Zixin Chen, Hongzhan Lin, Kaixin Li, Ziyang Luo, Zhen Ye, Guang Chen, Zhiyong Huang, Jing Ma

PDF

Open Access 1 Video

TL;DR

AdamMeme introduces an adaptive, multi-agent evaluation framework for assessing multimodal large language models' ability to understand meme harmfulness, overcoming static dataset limitations and revealing model weaknesses.

Contribution

It presents a novel, flexible evaluation method that adaptively probes mLLMs' reasoning on harmful memes through multi-agent collaboration and iterative data updating.

Findings

01

Systematically reveals performance variations among mLLMs

02

Provides in-depth analysis of model-specific weaknesses

03

Offers a dynamic, up-to-date assessment framework

Abstract

The proliferation of multimodal memes in the social media era demands that multimodal Large Language Models (mLLMs) effectively understand meme harmfulness. Existing benchmarks for assessing mLLMs on harmful meme understanding rely on accuracy-based, model-agnostic evaluations using static datasets. These benchmarks are limited in their ability to provide up-to-date and thorough assessments, as online memes evolve dynamically. To address this, we propose AdamMeme, a flexible, agent-based evaluation framework that adaptively probes the reasoning capabilities of mLLMs in deciphering meme harmfulness. Through multi-agent collaboration, AdamMeme provides comprehensive evaluations by iteratively updating the meme data with challenging samples, thereby exposing specific limitations in how mLLMs interpret harmfulness. Extensive experiments show that our framework systematically reveals the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Topic Modeling