Clustering Filipino Disaster-Related Tweets Using Incremental and Density-Based Spatiotemporal Algorithm with Support Vector Machines for Needs Assessment
Ocean M. Barba, Franz Arvin T. Calbay, Angelica Jane S. Francisco,, Angel Luis D. Santos, Charmaine S. Ponay

TL;DR
This study develops a multi-step machine learning approach to analyze Filipino disaster-related tweets, clustering them by disaster type and location, and classifying expressed needs with high accuracy to aid disaster response efforts.
Contribution
It introduces an integrated method combining incremental and density-based clustering with SVM classification for disaster-related social media analysis.
Findings
Clustering algorithms achieved F-measure scores of 47.20% and 82.28%.
Naive Bayes classifier achieved 97% F-measure.
SVM classifier achieved 77.57% accuracy.
Abstract
Social media has played a huge part on how people get informed and communicate with one another. It has helped people express their needs due to distress especially during disasters. Because posts made through it are publicly accessible by default, Twitter is among the most helpful social media sites in times of disaster. With this, the study aims to assess the needs expressed during calamities by Filipinos on Twitter. Data were gathered and classified as either disaster-related or unrelated with the use of Na\"ive Bayes classifier. After this, the disaster-related tweets were clustered per disaster type using Incremental Clustering Algorithm, and then sub-clustered based on the location and time of the tweet using Density-based Spatiotemporal Clustering Algorithm. Lastly, using Support Vector Machines, the tweets were classified according to the expressed need, such as shelter, rescue,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPublic Relations and Crisis Communication · Complex Network Analysis Techniques · Sentiment Analysis and Opinion Mining
