WeatherQA: Can Multimodal Language Models Reason about Severe Weather?
Chengqian Ma, Zhanxiang Hua, Alexandra Anderson-Frey, Vikram Iyer, Xin, Liu, Lianhui Qin

TL;DR
WeatherQA introduces a multimodal dataset for reasoning about severe weather, enabling models to analyze complex environmental data and forecast dangerous weather events more effectively.
Contribution
This work presents the first multimodal weather reasoning dataset, WeatherQA, and evaluates state-of-the-art vision-language models on challenging forecasting tasks.
Findings
Models show significant performance gap compared to human reasoning.
GPT4 outperforms other models but still lags behind experts.
Identifies key weaknesses in current multimodal weather understanding.
Abstract
Severe convective weather events, such as hail, tornadoes, and thunderstorms, often occur quickly yet cause significant damage, costing billions of dollars every year. This highlights the importance of forecasting severe weather threats hours in advance to better prepare meteorologists and residents in at-risk areas. Can modern large foundation models perform such forecasting? Existing weather benchmarks typically focus only on predicting time-series changes in certain weather parameters (e.g., temperature, moisture) with text-only features. In this work, we introduce WeatherQA, the first multimodal dataset designed for machines to reason about complex combinations of weather parameters (a.k.a., ingredients) and predict severe weather in real-world scenarios. The dataset includes over 8,000 (multi-images, text) pairs for diverse severe weather events. Each pair contains rich information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Climate Change Communication and Perception
MethodsFocus
