Detecting Rumours with Latency Guarantees using Massive Streaming Data
Thanh Tam Nguyen, Thanh Trung Huynh, Hongzhi Yin, Matthias Weidlich,, Thanh Thi Nguyen, Thai Son Mai, Quoc Viet Hung Nguyen

TL;DR
This paper presents a novel approach for rapid rumour detection in massive social media data streams by combining graph-based pattern matching with load shedding, prioritizing quick detection over complete accuracy.
Contribution
It introduces a best-effort rumour detection method that balances speed and accuracy using load shedding and graph matching techniques, addressing latency challenges in high-volume data streams.
Findings
Robust detection performance under diverse streaming conditions
Efficient runtime performance on large-scale datasets
Maintains high detection accuracy despite data discarding
Abstract
Today's social networks continuously generate massive streams of data, which provide a valuable starting point for the detection of rumours as soon as they start to propagate. However, rumour detection faces tight latency bounds, which cannot be met by contemporary algorithms, given the sheer volume of high-velocity streaming data emitted by social networks. Hence, in this paper, we argue for best-effort rumour detection that detects most rumours quickly rather than all rumours with a high delay. To this end, we combine techniques for efficient, graph-based matching of rumour patterns with effective load shedding that discards some of the input data while minimising the loss in accuracy. Experiments with large-scale real-world datasets illustrate the robustness of our approach in terms of runtime performance and detection accuracy under diverse streaming conditions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Data Stream Mining Techniques · Network Security and Intrusion Detection
