Can We Utilize Pre-trained Language Models within Causal Discovery   Algorithms?

Chanhui Lee (1); Juhyeon Kim (2); Yongjun Jeong (3); Juhyun Lyu (4),; Junghee Kim (4); Sangmin Lee (4); Sangjun Han (4); Hyeokjun Choe (4); Soyeon; Park (4); Woohyung Lim (4); Sungbin Lim (5,6); Sanghack Lee (2,7) ((1); Department of Artificial Intelligence; Korea University; (2) Graduate School; of Data Science; Seoul National University; (3) Department of Computer; Science; Engineering; UNIST; (4) Data Intelligence Laboratory; LG AI; Research; (5) Department of Statistics; Korea University; (6) LG AI Research,; (7) SNU-LG AI Research Center)

arXiv:2311.11212·cs.AI·November 21, 2023·1 cites

Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?

Chanhui Lee (1), Juhyeon Kim (2), Yongjun Jeong (3), Juhyun Lyu (4),, Junghee Kim (4), Sangmin Lee (4), Sangjun Han (4), Hyeokjun Choe (4), Soyeon, Park (4), Woohyung Lim (4), Sungbin Lim (5,6), Sanghack Lee (2,7) ((1), Department of Artificial Intelligence, Korea University

PDF

Open Access

TL;DR

This paper investigates the limitations of using pre-trained language models for causal discovery and proposes a new framework that combines PLM-derived prior knowledge with causal discovery algorithms to improve causal inference accuracy.

Contribution

It identifies the limitations of PLMs in causal discovery and introduces a novel framework that integrates prior knowledge from PLMs into causal discovery algorithms.

Findings

01

Empirical demonstration of PLM limitations in causal reasoning.

02

Proposed framework improves causal discovery performance.

03

Guidelines for leveraging PLM-extracted knowledge in causal algorithms.

Abstract

Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of repetitive causal reasoning, achieved through specifically designed prompts. It highlights the usefulness of PLMs in discovering cause and effect, which is often limited by a lack of data, especially when dealing with multiple variables. Conversely, the characteristics of PLMs which are that PLMs do not analyze data and they are highly dependent on prompt design leads to a crucial limitation for directly using PLMs in causal discovery. Accordingly, PLM-based causal reasoning deeply depends on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management