Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for   Problem-Solving Improvement of LLMs

Ruoyu Wang; Xiaoxuan Li; Lina Yao

arXiv:2409.02686·cs.CL·October 8, 2024

Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs

Ruoyu Wang, Xiaoxuan Li, Lina Yao

PDF

Open Access

TL;DR

This paper introduces a causal framework and a parameter-efficient fine-tuning method called Deconfounded Causal Adaptation (DCA) to improve the reasoning abilities of large language models, demonstrating superior performance with minimal additional parameters.

Contribution

The paper proposes a novel causal framework for understanding LLM reasoning and introduces DCA, a PEFT method that enhances reasoning skills with only 1.2 million parameters.

Findings

01

DCA outperforms baseline methods on multiple benchmarks.

02

Achieves comparable or better results with significantly fewer parameters.

03

Enhances LLM reasoning capabilities and reliability.

Abstract

Large Language Models (LLMs) have demonstrated remarkable efficiency in tackling various tasks based on human instructions, but studies reveal that they often struggle with tasks requiring reasoning, such as math or physics. This limitation raises questions about whether LLMs truly comprehend embedded knowledge or merely learn to replicate the token distribution without a true understanding of the content. In this paper, we delve into this problem and aim to enhance the reasoning capabilities of LLMs. First, we investigate if the model has genuine reasoning capabilities by visualizing the text generation process at the attention and representation level. Then, we formulate the reasoning process of LLMs into a causal framework, which provides a formal explanation of the problems observed in the visualization. Finally, building upon this causal framework, we propose Deconfounded Causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning

MethodsSoftmax · Attention Is All You Need