Online and Offline Analysis of Streaming Data
Sheik Hoque, Andriy Miranskyy

TL;DR
This paper introduces a scalable, maintainable 7-layer architecture combining online and offline streaming data analysis using microservices, publish-subscribe, and persistent storage, unifying previously separate approaches.
Contribution
It presents a novel 7-layer architecture that integrates online and offline streaming data analysis within a single scalable framework.
Findings
Design ensures high cohesion and low coupling.
Supports asynchronous communication between layers.
Facilitates unified online and offline data analysis.
Abstract
Online and offline analytics have been traditionally treated separately in software architecture design, and there is no existing general architecture that can support both. Our objective is to go beyond and introduce a scalable and maintainable architecture for performing online as well as offline analysis of streaming data. In this paper, we propose a 7-layered architecture utilising microservices, publish-subscribe pattern, and persistent storage. The architecture ensures high cohesion, low coupling, and asynchronous communication between the layers, thus yielding a scalable and maintainable solution. This design can help practitioners to engage their online and offline use cases in one single architecture, and also is of interest to academics, as it is a building block for a general architecture supporting data analysis.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
