GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art
Yiming Lei, Chenkai Zhang, Zeming Liu, Haitao Leng, Shaoguo Liu, Tingting Gao, Qingjie Liu, Yunhong Wang

TL;DR
GODBench is a new benchmark for evaluating multimodal large language models' ability to create creative video comments, addressing current limitations in understanding and generating humor, satire, and emotional content.
Contribution
The paper introduces GODBench, a comprehensive multimodal benchmark, and Ripple of Thought, a reasoning framework to enhance creativity in video comment generation by MLLMs.
Findings
Existing MLLMs struggle with creative video comments.
Ripple of Thought improves creative reasoning in MLLMs.
GODBench enables systematic evaluation of multimodal creativity.
Abstract
Video Comment Art enhances user engagement by providing creative content that conveys humor, satire, or emotional resonance, requiring a nuanced and comprehensive grasp of cultural and contextual subtleties. Although Multimodal Large Language Models (MLLMs) and Chain-of-Thought (CoT) have demonstrated strong reasoning abilities in STEM tasks (e.g. mathematics and coding), they still struggle to generate creative expressions such as resonant jokes and insightful satire. Moreover, existing benchmarks are constrained by their limited modalities and insufficient categories, hindering the exploration of comprehensive creativity in video-based Comment Art creation. To address these limitations, we introduce GODBench, a novel benchmark that integrates video and text modalities to systematically evaluate MLLMs' abilities to compose Comment Art. Furthermore, inspired by the propagation patterns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Artificial Intelligence in Games · Sentiment Analysis and Opinion Mining
