Towards a Generic Multimodal Architecture for Batch and Streaming Big Data Integration
Siham Yousfi, Maryem Rhanoui, Dalila Chiadmi

TL;DR
This paper proposes a generic multimodal lambda architecture that integrates batch and streaming processing to analyze heterogeneous Big Data sources in near-real-time, improving urban traffic congestion detection.
Contribution
It introduces a novel architecture combining batch and streaming layers for multimodal data, addressing data reliability and completeness in Big Data integration.
Findings
Effective in urban traffic congestion detection
Integrates heterogeneous data sources in real-time
Enhances data reliability assessment
Abstract
Big Data are rapidly produced from various heterogeneous data sources. They are of different types (text, image, video or audio) and have different levels of reliability and completeness. One of the most interesting architectures that deal with the large amount of emerging data at high velocity is called the lambda architecture. In fact, it combines two different processing layers namely batch and speed layers, each providing specific views of data while ensuring robustness, fast and scalable data processing. However, most papers dealing with the lambda architecture are focusing one single type of data generally produced by a single data source. Besides, the layers of the architecture are implemented independently, or, at best, are combined to perform basic processing without assessing either the data reliability or completeness. Therefore, inspired by the lambda architecture, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
