MapIQ: Evaluating Multimodal Large Language Models for Map Question Answering

Varun Srivastava; Fan Lei; Srija Mukhopadhyay; Vivek Gupta; Ross Maciejewski

arXiv:2507.11625·cs.CL·October 7, 2025

MapIQ: Evaluating Multimodal Large Language Models for Map Question Answering

Varun Srivastava, Fan Lei, Srija Mukhopadhyay, Vivek Gupta, Ross Maciejewski

PDF

Open Access

TL;DR

This paper introduces MapIQ, a comprehensive benchmark dataset for evaluating multimodal large language models on map question answering across various map types and themes, highlighting model robustness and sensitivity to design changes.

Contribution

The paper presents MapIQ, a new large-scale dataset and evaluation framework for assessing MLLMs on diverse map visual question answering tasks and map design variations.

Findings

01

MLLMs show varying performance across map types and themes.

02

Map design changes significantly affect MLLM responses.

03

Humans outperform MLLMs in map question answering tasks.

Abstract

Recent advancements in multimodal large language models (MLLMs) have driven researchers to explore how well these models read data visualizations, e.g., bar charts, scatter plots. More recently, attention has shifted to visual question answering with maps (Map-VQA). However, Map-VQA research has primarily focused on choropleth maps, which cover only a limited range of thematic categories and visual analytical tasks. To address these gaps, we introduce MapIQ, a benchmark dataset comprising 14,706 question-answer pairs across three map types: choropleth maps, cartograms, and proportional symbol maps spanning topics from six distinct themes (e.g., housing, crime). We evaluate multiple MLLMs using six visual analytical tasks, comparing their performance against one another and a human baseline. An additional experiment examining the impact of map design changes (e.g., altered color schemes,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeographic Information Systems Studies · Topic Modeling · Natural Language Processing Techniques