Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models
Mingzi Cao, Xingwei Tan, Mahmud Elahi Akhter, Marco Valentino, Maria Liakata, Xi Wang, Nikolaos Aletras

TL;DR
This paper explores how fundamental reasoning paradigms like deduction, induction, and abduction influence large language models' ability to generalize out-of-domain, using a new dataset and various training methods to enhance reasoning skills.
Contribution
It introduces a dataset of reasoning trajectories for core paradigms and systematically evaluates methods to induce these reasoning skills into LLMs for better out-of-domain generalization.
Findings
Inducing reasoning paradigms improves out-of-domain performance by up to 14.60 points.
Different training approaches, including fine-tuning and model transformation, effectively enhance reasoning capabilities.
The study demonstrates the importance of fundamental reasoning paradigms for robust LLM generalization.
Abstract
Deduction, induction, and abduction are fundamental reasoning paradigms, core for human logical thinking. Although improving Large Language Model (LLM) reasoning has attracted significant research efforts, the extent to which the fundamental paradigms induce generalization has yet to be systematically explored. In this study, we shed light on how the interplay between these core paradigms influences LLMs' reasoning behavior. To this end, we first collect a new dataset of reasoning trajectories from symbolic tasks, each targeting one of the three fundamental paradigms, to abstract from concrete world knowledge. Then, we investigate effective ways for inducing these skills into LLMs. We experiment with a battery of methods including simple fine-tuning, and more complex approaches to increase model depth, or transform a dense model to a mixture-of-experts. We comprehensively evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
