Fast Data Management with Distributed Streaming SQL

Milinda Pathirage; Beth Plale

arXiv:1511.03935·cs.DB·November 13, 2015·1 cites

Fast Data Management with Distributed Streaming SQL

Milinda Pathirage, Beth Plale

PDF

Open Access

TL;DR

This paper proposes a standard SQL-based streaming query model to improve fast data management in distributed stream processing systems, addressing current limitations in responsiveness and flexibility.

Contribution

It introduces a set of requirements and a novel SQL-based streaming query model for efficient management of fast data in distributed systems.

Findings

01

Identified key requirements for SQL-based stream querying

02

Proposed a standard streaming query model for fast data

03

Addressed responsiveness issues in current stream processing systems

Abstract

To stay competitive in today's data driven economy, enterprises large and small are turning to stream processing platforms to process high volume, high velocity, and diverse streams of data (fast data) as they arrive. Low-level programming models provided by the popular systems of today suffer from lack of responsiveness to change: enhancements require code changes with attendant large turn-around times. Even though distributed SQL query engines have been available for Big Data, we still lack support for SQL-based stream querying capabilities in distributed stream processing systems. In this white paper, we identify a set of requirements and propose a standard SQL based streaming query model for management of what has been referred to as Fast Data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Database Systems and Queries · Advanced Data Storage Technologies · Distributed and Parallel Computing Systems