Reconstructing temporal multi-relational firm networks at scale using large language models. The case of the semiconductor industry
Seyda K\"ose, Christian Diem, Elma Dervic, Klaus Friesenbichler, Georg Heiler, Jan Hurt, Hernan Picatto, Peter Klimek

TL;DR
This paper presents a novel method using large language models and open web data to reconstruct and analyze the complex, dynamic network of firms in the semiconductor industry at scale.
Contribution
It introduces a generalizable approach combining LLMs with web data to map and analyze multi-relational firm networks, overcoming limitations of proprietary databases.
Findings
Reconstructed a network of over 1,300 firms from 170 million webpages.
Validated link extraction with high precision and F1-score.
Detected industry shifts such as a 9% decline in connections during the 2022 chip shortage.
Abstract
The semiconductor industry is foundational to modern technology, yet its complex global multi-relational firm network remains poorly understood, posing challenges to scientists, firms, and policymakers. Traditional analysis relies on proprietary databases that are often expensive, incomplete, and slowly updated, limiting their ability to capture rapidly evolving dependencies. Here, we demonstrate that a novel, generalizable methodology combining Large Language Models (LLMs) with open web data can reconstruct this network and its structural dynamics at scale. We identify and classify supply-chain, partnership, and ownership links from 170 million semiconductor firm webpages, yielding a temporal network of over 1,300 linked firms. We validate link-extraction quality (Precision: 0.884; F1-score: 0.784), network overlap and complementarity with a proprietary database, and consistency with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
