Online Density-Based Clustering for Real-Time Narrative Evolution Monitorin
Ostap Vykhopen, Viktoria Skorik, Maksym Tereshchenko, Veronika Solopova

TL;DR
This paper explores replacing batch clustering with online density-based algorithms like DenStream to improve real-time social media narrative monitoring, balancing scalability, cluster quality, and interpretability.
Contribution
It introduces an online clustering approach tailored for large-scale, multilingual social media data streams, enhancing real-time narrative analysis capabilities.
Findings
DenStream outperforms other online methods in overall performance.
Trade-offs exist between temporal stability and narrative coherence.
Online clustering enables scalable, real-time social media monitoring.
Abstract
Automated narrative intelligence systems for social media monitoring face significant scalability challenges when relying on batch clustering methods to process continuous data streams. We investigate replacing offline HDBSCAN with online density-based clustering algorithms in a production narrative report generation pipeline that processes large volumes of multilingual social media data. While HDBSCAN effectively discovers hierarchical clusters and handles noise, its batch-only nature requires full retraining for each time window, limiting scalability and real-time adaptability. We evaluate online clustering methods with respect to cluster quality, computational efficiency, memory footprint, and integration with downstream narrative extraction. Our evaluation combines standard clustering metrics, narrative-specific measures, and human validation of cluster correctness to assess both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Expert finding and Q&A systems · Advanced Clustering Algorithms Research
