LLM4Causal: Democratized Causal Tools for Everyone via Large Language   Model

Haitao Jiang; Lin Ge; Yuhe Gao; Jianian Wang; Rui Song

arXiv:2312.17122·cs.CL·October 29, 2024·1 cites

LLM4Causal: Democratized Causal Tools for Everyone via Large Language Model

Haitao Jiang, Lin Ge, Yuhe Gao, Jianian Wang, Rui Song

PDF

Open Access

TL;DR

This paper introduces LLM4Causal, an open-source fine-tuned large language model designed to perform causal inference tasks, interpret results, and provide accessible causal reasoning tools for users, enhancing LLM capabilities in structured causal reasoning.

Contribution

The work presents a novel fine-tuning approach with new datasets enabling LLMs to identify causal tasks, execute causal functions, and interpret results, making causal reasoning more accessible.

Findings

01

LLM4Causal outperforms baselines in causal problem solving

02

It provides clear, interpretable causal explanations

03

End-to-end causal inference is effectively achieved

Abstract

Large Language Models (LLMs) have shown their success in language understanding and reasoning on general topics. However, their capability to perform inference based on user-specified structured data and knowledge in corpus-rare concepts, such as causal decision-making is still limited. In this work, we explore the possibility of fine-tuning an open-sourced LLM into LLM4Causal, which can identify the causal task, execute a corresponding function, and interpret its numerical results based on users' queries and the provided dataset. Meanwhile, we propose a data generation process for more controllable GPT prompting and present two instruction-tuning datasets: (1) Causal-Retrieval-Bench for causal problem identification and input parameter extraction for causal function calling and (2) Causal-Interpret-Bench for in-context causal interpretation. By conducting end-to-end evaluations and two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Attention Dropout · Linear Layer · Adam · Discriminative Fine-Tuning · Dense Connections · Linear Warmup With Cosine Annealing · Weight Decay