DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster Management

Kai Yin; Xiangjue Dong; Chengkai Liu; Lipai Huang; Yiming Xiao; Zhewei Liu; Ali Mostafavi; James Caverlee

arXiv:2505.15856·cs.IR·September 23, 2025

DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster Management

Kai Yin, Xiangjue Dong, Chengkai Liu, Lipai Huang, Yiming Xiao, Zhewei Liu, Ali Mostafavi, James Caverlee

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

DisastIR is a new comprehensive benchmark designed specifically for evaluating information retrieval systems in disaster management, addressing the unique challenges and needs of this domain.

Contribution

It introduces the first large-scale, domain-specific IR benchmark for disaster management, with diverse queries and tasks to better evaluate model performance in this critical area.

Findings

01

Significant performance variation among state-of-the-art models.

02

No single model performs best across all disaster-related tasks.

03

Disaster-specific IR benchmarks reveal gaps in general-domain models.

Abstract

Effective disaster management requires timely access to accurate and contextually relevant information. Existing Information Retrieval (IR) benchmarks, however, focus primarily on general or specialized domains, such as medicine or finance, neglecting the unique linguistic complexity and diverse information needs encountered in disaster management scenarios. To bridge this gap, we introduce DisastIR, the first comprehensive IR evaluation benchmark specifically tailored for disaster management. DisastIR comprises 9,600 diverse user queries and more than 1.3 million labeled query-passage pairs, covering 48 distinct retrieval tasks derived from six search intents and eight general disaster categories that include 301 specific event types. Our evaluations of 30 state-of-the-art retrieval models demonstrate significant performance variances across tasks, with no single model excelling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kaiyin97/disaster_ir
noneOfficial

Datasets

DMIR01/DisastIR
dataset· 105 dl
105 dl

Videos

DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster Management· underline

Taxonomy

TopicsPublic Relations and Crisis Communication · Information Retrieval and Search Behavior · Expert finding and Q&A systems

MethodsFocus