VectraFlow: Long-Horizon Semantic Processing over Data and Event Streams with LLMs
Shu Chen, Junhan Liu, Deepti Raghavan, Ugur Cetintemel

TL;DR
VectraFlow is a semantic streaming engine that enables long-horizon, stateful reasoning over unstructured text streams using LLMs, combining traditional CEP with semantic operators for real-time analysis.
Contribution
It introduces a novel semantic dataflow engine that extends CEP with LLM-powered operators for unstructured text streams, supporting complex event pattern detection.
Findings
Supports configurable throughput-accuracy tradeoffs.
Enables real-time semantic pipeline composition.
Demonstrates temporal pattern detection over clinical texts.
Abstract
Monitoring continuous data for meaningful signals increasingly demands long-horizon, stateful reasoning over unstructured streams. However, today's LLM frameworks remain stateless and one-shot, and traditional Complex Event Processing (CEP) systems, while capable of temporal pattern detection, assume structured, typed event streams that leave unstructured text out of reach. We demonstrate VectraFlow, a semantic streaming dataflow engine, to address both gaps. VectraFlow extends traditional relational operators with LLM-powered execution over free-text streams, offering a suite of continuous semantic operators -- filter, map, aggregate, join, group-by, and window -- each with configurable throughput-accuracy tradeoffs across LLM-based, embedding-based, and hybrid implementations. Building on this, a semantic event pattern operator lifts complex event processing to unstructured document…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
