Link Prediction for Event Logs in the Process Industry
Anastasia Zhukova, Thomas Walton, Christian E. Lobm\"uller, Bela Gipp

TL;DR
This paper presents a novel record linking model for event logs in the process industry, combining NLP techniques to improve data connectivity and support better decision-making.
Contribution
It adapts cross-document coreference resolution with NLI and semantic similarity for link prediction in fragmented process industry logs, outperforming baseline models.
Findings
The model outperformed baselines by over 27% in link prediction accuracy.
Combining NLP tasks improves data quality in German process industry logs.
The approach enhances knowledge management and operational decision-making.
Abstract
In the era of graph-based retrieval-augmented generation (RAG), link prediction is a significant preprocessing step for improving the quality of fragmented or incomplete domain-specific data for the graph retrieval. Knowledge management in the process industry uses RAG-based applications to optimize operations, ensure safety, and facilitate continuous improvement by effectively leveraging operational data and past insights. A key challenge in this domain is the fragmented nature of event logs in shift books, where related records are often kept separate, even though they belong to a single event or process. This fragmentation hinders the recommendation of previously implemented solutions to users, which is crucial in the timely problem-solving at live production sites. To address this problem, we develop a record linking model, which we define as a cross-document coreference resolution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
