FirstAidQA: A Synthetic Dataset for First Aid and Emergency Response in Low-Connectivity Settings
Saiyma Sittul Muna, Rezwan Islam Salvi, Mushfiqur Rahman Mushfique, Ajwad Abrar

TL;DR
FirstAidQA is a synthetic dataset of 5,500 high-quality question-answer pairs created using ChatGPT-4o-mini, aimed at enabling lightweight, offline-capable AI models for first aid and emergency response in low-connectivity environments.
Contribution
The paper introduces FirstAidQA, a novel high-quality synthetic dataset specifically designed for training AI models in emergency and first aid scenarios, addressing a critical data gap.
Findings
Dataset supports instruction-tuning of LLMs and SLMs.
Enables development of offline, resource-efficient emergency response AI.
Publicly available for research and development.
Abstract
In emergency situations, every second counts. The deployment of Large Language Models (LLMs) in time-sensitive, low or zero-connectivity environments remains limited. Current models are computationally intensive and unsuitable for low-tier devices often used by first responders or civilians. A major barrier to developing lightweight, domain-specific solutions is the lack of high-quality datasets tailored to first aid and emergency response. To address this gap, we introduce FirstAidQA, a synthetic dataset containing 5,500 high-quality question answer pairs that encompass a wide range of first aid and emergency response scenarios. The dataset was generated using a Large Language Model, ChatGPT-4o-mini, with prompt-based in-context learning, using texts from the Vital First Aid Book (2019). We applied preprocessing steps such as text cleaning, contextual chunking, and filtering, followed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Cardiac Arrest and Resuscitation
