M3GIA: A Cognition Inspired Multilingual and Multimodal General Intelligence Ability Benchmark
Wei Song, Yadong Li, Jianhua Xu, Guowei Wu, Lingfeng Ming, Kexin Yi,, Weihua Luo, Houyi Li, Yi Du, Fangda Guo, Kaicheng Yu

TL;DR
This paper introduces M3GIA, a novel multilingual and multimodal benchmark inspired by cognitive science to evaluate the general intelligence of large language models beyond superficial task performance.
Contribution
It proposes a cognitive-driven evaluation framework based on the CHC model, encompassing multiple languages and cultural contexts to assess MLLMs' cognitive abilities.
Findings
Advanced MLLMs reach the lower boundary of human intelligence in English.
Significant disparities in cognitive abilities across different languages.
Reveals a winner-takes-all phenomenon in cognitive performance.
Abstract
As recent multi-modality large language models (MLLMs) have shown formidable proficiency on various complex tasks, there has been increasing attention on debating whether these models could eventually mirror human intelligence. However, existing benchmarks mainly focus on evaluating solely on task performance, such as the accuracy of identifying the attribute of an object. Combining well-developed cognitive science to understand the intelligence of MLLMs beyond superficial achievements remains largely unexplored. To this end, we introduce the first cognitive-driven multi-lingual and multi-modal benchmark to evaluate the general intelligence ability of MLLMs, dubbed M3GIA. Specifically, we identify five key cognitive factors based on the well-recognized Cattell-Horn-Carrol (CHC) model of intelligence and propose a novel evaluation metric. In addition, since most MLLMs are trained to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Teaching and Learning Programming
MethodsFocus
