# HyperStream: a Workflow Engine for Streaming Data

**Authors:** Tom Diethe, Meelis Kull, Niall Twomey, Kacper Sokol, Hao Song, Miquel, Perello-Nieto, Emma Tonkin, Peter Flach

arXiv: 1908.02858 · 2019-08-09

## TL;DR

HyperStream is a Python-based workflow engine designed for large-scale streaming data processing, enabling complex online and offline machine learning tasks with high flexibility and robustness.

## Contribution

It introduces a versatile, high-level Python framework that overcomes existing limitations in streaming data engines for machine learning applications.

## Key findings

- Supports complex nesting, fusion, and prediction in streaming environments
- Facilitates development and deployment of machine learning algorithms
- Provides comprehensive tools and documentation for users

## Abstract

This paper describes HyperStream, a large-scale, flexible and robust software package, written in the Python language, for processing streaming data with workflow creation capabilities. HyperStream overcomes the limitations of other computational engines and provides high-level interfaces to execute complex nesting, fusion, and prediction both in online and offline forms in streaming environments. HyperStream is a general purpose tool that is well-suited for the design, development, and deployment of Machine Learning algorithms and predictive models in a wide space of sequential predictive problems.   Source code, installation instructions, examples, and documentation can be found at: https://github.com/IRC-SPHERE/HyperStream.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.02858/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1908.02858/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/1908.02858/full.md

---
Source: https://tomesphere.com/paper/1908.02858