Engineering Crowdsourced Stream Processing Systems
Muhammad Imran, Ioanna Lykourentzou, Yannick Naudet, Carlos Castillo

TL;DR
This paper introduces a framework for designing and evaluating crowdsourced stream processing systems, demonstrating improved accuracy and efficiency in classifying social media during crises.
Contribution
It presents a taxonomy, design principles, evaluation metrics, and patterns for CSP systems, along with a case study applying these to a real-world system.
Findings
AIDR achieves higher classification accuracy than pure stream processing.
AIDR reduces manual effort compared to pure crowdsourcing.
The framework effectively guides CSP system design and analysis.
Abstract
A crowdsourced stream processing system (CSP) is a system that incorporates crowdsourced tasks in the processing of a data stream. This can be seen as enabling crowdsourcing work to be applied on a sample of large-scale data at high speed, or equivalently, enabling stream processing to employ human intelligence. It also leads to a substantial expansion of the capabilities of data processing systems. Engineering a CSP system requires the combination of human and machine computation elements. From a general systems theory perspective, this means taking into account inherited as well as emerging properties from both these elements. In this paper, we position CSP systems within a broader taxonomy, outline a series of design principles and evaluation metrics, present an extensible framework for their design, and describe several design patterns. We showcase the capabilities of CSP systems by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications
