Structurally Valid Log Generation using FSM-GFlowNets
Riya Samanta

TL;DR
This paper introduces a novel framework combining FSMs and GFlowNets to generate structurally valid, diverse, and realistic synthetic event logs, improving over existing methods in coherence and controllability.
Contribution
The paper presents a new FSM-constrained GFlowNet approach for generating valid and diverse event logs, with a hybrid reward system and dynamic action masking, applicable across symbolic sequence domains.
Findings
Produces realistic, structurally consistent logs with low divergence metrics.
Outperforms GPT-4o and Gemini in distributional similarity metrics.
Synthetic logs enable effective downstream classification tasks.
Abstract
Generating structurally valid and behaviorally diverse synthetic event logs for interaction-aware models is a challenging yet crucial problem, particularly in settings with limited or privacy constrained user data. Existing methods such as heuristic simulations and LLM based generators often lack structural coherence or controllability, producing synthetic data that fails to accurately represent real world system interactions. This paper presents a framework that integrates Finite State Machines or FSMs with Generative Flow Networks or GFlowNets to generate structured, semantically valid, and diverse synthetic event logs. Our FSM-constrained GFlowNet ensures syntactic validity and behavioral variation through dynamic action masking and guided sampling. The FSM, derived from expert traces, encodes domain-specific rules, while the GFlowNet is trained using a flow matching objective with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Machine Learning in Healthcare · Business Process Modeling and Analysis
