Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?
Chanhui Lee (1), Juhyeon Kim (2), Yongjun Jeong (3), Juhyun Lyu (4),, Junghee Kim (4), Sangmin Lee (4), Sangjun Han (4), Hyeokjun Choe (4), Soyeon, Park (4), Woohyung Lim (4), Sungbin Lim (5,6), Sanghack Lee (2,7) ((1), Department of Artificial Intelligence, Korea University

TL;DR
This paper investigates the limitations of using pre-trained language models for causal discovery and proposes a new framework that combines PLM-derived prior knowledge with causal discovery algorithms to improve causal inference accuracy.
Contribution
It identifies the limitations of PLMs in causal discovery and introduces a novel framework that integrates prior knowledge from PLMs into causal discovery algorithms.
Findings
Empirical demonstration of PLM limitations in causal reasoning.
Proposed framework improves causal discovery performance.
Guidelines for leveraging PLM-extracted knowledge in causal algorithms.
Abstract
Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of repetitive causal reasoning, achieved through specifically designed prompts. It highlights the usefulness of PLMs in discovering cause and effect, which is often limited by a lack of data, especially when dealing with multiple variables. Conversely, the characteristics of PLMs which are that PLMs do not analyze data and they are highly dependent on prompt design leads to a crucial limitation for directly using PLMs in causal discovery. Accordingly, PLM-based causal reasoning deeply depends on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
