Using GPT-4 to guide causal machine learning
Anthony C. Constantinou, Neville K. Kitson, Alessio Zanga

TL;DR
This paper evaluates GPT-4's ability to identify causal relationships from label-only data, demonstrating its potential to enhance causal discovery when combined with traditional causal machine learning methods.
Contribution
It shows that GPT-4 can effectively assist in causal graph inference, improving the accuracy of causal ML algorithms and addressing trust issues in causal modeling.
Findings
GPT-4 graphs judged most accurate by participants
Pairing GPT-4 with causal ML improves causal graph quality
GPT-4's causal reasoning enhances traditional causal discovery methods
Abstract
Since its introduction to the public, ChatGPT has had an unprecedented impact. While some experts praised AI advancements and highlighted their potential risks, others have been critical about the accuracy and usefulness of Large Language Models (LLMs). In this paper, we are interested in the ability of LLMs to identify causal relationships. We focus on the well-established GPT-4 (Turbo) and evaluate its performance under the most restrictive conditions, by isolating its ability to infer causal relationships based solely on the variable labels without being given any other context by humans, demonstrating the minimum level of effectiveness one can expect when it is provided with label-only information. We show that questionnaire participants judge the GPT-4 graphs as the most accurate in the evaluated categories, closely followed by knowledge graphs constructed by domain experts, with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI) · Machine Learning and Algorithms
MethodsAttention Is All You Need · Label Smoothing · Adam · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Dense Connections · Multi-Head Attention
