Learning state machines via efficient hashing of future traces

Robert Baumgartner; Sicco Verwer

arXiv:2207.01516·cs.FL·July 5, 2022

Learning state machines via efficient hashing of future traces

Robert Baumgartner, Sicco Verwer

PDF

Open Access 1 Repo

TL;DR

This paper introduces a memory-efficient method for learning state machines from streaming data by combining count-min-sketch data structures with state merging techniques, enabling scalable analysis of large datasets.

Contribution

It presents a novel approach that integrates count-min-sketch with state merging to efficiently learn state machines from data streams, reducing memory usage and maintaining result quality.

Findings

01

Effective in reducing memory requirements

02

Maintains high quality of learned state machines

03

Demonstrates improved run-time performance

Abstract

State machines are popular models to model and visualize discrete systems such as software systems, and to represent regular grammars. Most algorithms that passively learn state machines from data assume all the data to be available from the beginning and they load this data into memory. This makes it hard to apply them to continuously streaming data and results in large memory requirements when dealing with large datasets. In this paper we propose a method to learn state machines from data streams using the count-min-sketch data structure to reduce memory requirements. We apply state merging using the well-known red-blue-framework to reduce the search space. We implemented our approach in an established framework for learning state machines, and evaluated it on a well know dataset to provide experimental data, showing the effectiveness of our approach with respect to quality of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tudelft-cda-lab/flexfringe
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Machine Learning and Algorithms