QuakeBERT: Accurate Classification of Social Media Texts for Rapid   Earthquake Impact Assessment

Jin Han; Zhe Zheng; Xin-Zheng Lu; Ke-Yin Chen; Jia-Rui Lin

arXiv:2405.06684·cs.CL·June 3, 2024

QuakeBERT: Accurate Classification of Social Media Texts for Rapid Earthquake Impact Assessment

Jin Han, Zhe Zheng, Xin-Zheng Lu, Ke-Yin Chen, Jia-Rui Lin

PDF

TL;DR

QuakeBERT is a domain-specific large language model designed to classify and filter social media texts rapidly, improving earthquake impact assessment accuracy and aiding disaster response efforts.

Contribution

This paper introduces the first domain-specific LLM for earthquake-related social media classification and an integrated method for impact assessment combining opinion, sentiment, and keyword analysis.

Findings

01

QuakeBERT improves macro F1 score by 27% with increased data diversity and volume.

02

QuakeBERT outperforms CNN and RNN models, achieving an F1 score of 84.33%.

03

The approach effectively detects noisy microblogs, enhancing post-disaster response.

Abstract

Social media aids disaster response but suffers from noise, hindering accurate impact assessment and decision making for resilient cities, which few studies considered. To address the problem, this study proposes the first domain-specific LLM model and an integrated method for rapid earthquake impact assessment. First, a few categories are introduced to classify and filter microblogs considering their relationship to the physical and social impacts of earthquakes, and a dataset comprising 7282 earthquake-related microblogs from twenty earthquakes in different locations is developed as well. Then, with a systematic analysis of various influential factors, QuakeBERT, a domain-specific large language model (LLM), is developed and fine-tuned for accurate classification and filtering of microblogs. Meanwhile, an integrated method integrating public opinion trend analysis, sentiment analysis,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.