Increasing the Accessibility of Causal Domain Knowledge via Causal   Information Extraction Methods: A Case Study in the Semiconductor   Manufacturing Industry

Houssam Razouk; Leonie Benischke; Daniel Garber; Roman Kern

arXiv:2411.10172·cs.CL·February 28, 2025

Increasing the Accessibility of Causal Domain Knowledge via Causal Information Extraction Methods: A Case Study in the Semiconductor Manufacturing Industry

Houssam Razouk, Leonie Benischke, Daniel Garber, Roman Kern

PDF

Open Access

TL;DR

This paper develops and evaluates automated causal information extraction methods tailored for industrial documents in semiconductor manufacturing, demonstrating high accuracy and practical applicability.

Contribution

It introduces two novel sequence tagging methods, SST and MST, optimized for semi-structured industrial documents, with a focus on real-world application performance.

Findings

01

MST achieves 93% F1 on FMEA documents

02

MST achieves 73% F1 on presentation slides

03

Domain-aligned language models improve extraction performance

Abstract

The extraction of causal information from textual data is crucial in the industry for identifying and mitigating potential failures, enhancing process efficiency, prompting quality improvements, and addressing various operational challenges. This paper presents a study on the development of automated methods for causal information extraction from actual industrial documents in the semiconductor manufacturing industry. The study proposes two types of causal information extraction methods, single-stage sequence tagging (SST) and multi-stage sequence tagging (MST), and evaluates their performance using existing documents from a semiconductor manufacturing company, including presentation slides and FMEA (Failure Mode and Effects Analysis) documents. The study also investigates the effect of representation learning on downstream tasks. The presented case study showcases that the proposed MST…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRough Sets and Fuzzy Logic