CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts
Rabindra Lamsal, Maria Rodriguez Read, Shanika Karunasekera

TL;DR
CrisisTransformers introduces specialized pre-trained language models and sentence encoders trained on crisis-related social media data, significantly improving NLP task performance for crisis informatics applications.
Contribution
The paper presents CrisisTransformers, a new ensemble of models trained on extensive crisis data, outperforming existing models in classification and sentence encoding tasks.
Findings
Models outperform baselines across all datasets.
Sentence encoder improves state-of-the-art by 17.43%.
Domain-specific models enhance semantic sentence embeddings.
Abstract
Social media platforms play an essential role in crisis communication, but analyzing crisis-related social media texts is challenging due to their informal nature. Transformer-based pre-trained models like BERT and RoBERTa have shown success in various NLP tasks, but they are not tailored for crisis-related texts. Furthermore, general-purpose sentence encoders are used to generate sentence embeddings, regardless of the textual complexities in crisis-related texts. Advances in applications like text classification, semantic search, and clustering contribute to the effective processing of crisis-related texts, which is essential for emergency responders to gain a comprehensive view of a crisis event, whether historical or real-time. To address these gaps in crisis informatics literature, this study introduces CrisisTransformers, an ensemble of pre-trained language models and sentence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPublic Relations and Crisis Communication · Disaster Management and Resilience
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Layer Normalization · Linear Layer · Dense Connections · Attention Dropout · Residual Connection · Adam · Weight Decay
