NLP4PBM: A Systematic Review on Process Extraction using Natural Language Processing with Rule-based, Machine and Deep Learning Methods
William Van Woensel, Soroor Motie

TL;DR
This review analyzes automated process extraction from text using NLP, highlighting the rise of ML/DL methods, challenges with datasets, and emerging use of LLMs for improved extraction accuracy.
Contribution
It systematically compares rule-based, ML, DL, and LLM approaches for process extraction, emphasizing current challenges and future directions.
Findings
ML/DL methods often outperform rule-based approaches
Lack of large annotated datasets hampers progress
Emerging LLM applications show promising results
Abstract
This literature review studies the field of automated process extraction, i.e., transforming textual descriptions into structured processes using Natural Language Processing (NLP). We found that Machine Learning (ML) / Deep Learning (DL) methods are being increasingly used for the NLP component. In some cases, they were chosen for their suitability towards process extraction, and results show that they can outperform classic rule-based methods. We also found a paucity of gold-standard, scalable annotated datasets, which currently hinders objective evaluations as well as the training or fine-tuning of ML / DL methods. Finally, we discuss preliminary work on the application of LLMs for automated process extraction, as well as promising developments in this field.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis · Big Data and Business Intelligence
