Exploring Self-supervised Logic-enhanced Training for Large Language   Models

Fangkai Jiao; Zhiyang Teng; Bosheng Ding; Zhengyuan Liu; Nancy F.; Chen; Shafiq Joty

arXiv:2305.13718·cs.CL·June 18, 2024·2 cites

Exploring Self-supervised Logic-enhanced Training for Large Language Models

Fangkai Jiao, Zhiyang Teng, Bosheng Ding, Zhengyuan Liu, Nancy F., Chen, Shafiq Joty

PDF

Open Access 2 Repos

TL;DR

This paper introduces LogicLLM, a self-supervised logic-enhanced training method for large language models, improving their logical reasoning capabilities beyond traditional supervised fine-tuning.

Contribution

It proposes a novel self-supervised post-training approach using a logic-oriented proxy task, enhancing LLMs' reasoning abilities without supervised data.

Findings

01

LogicLLM outperforms baseline models on logical reasoning benchmarks.

02

Self-supervised logic training improves LLM reasoning capabilities.

03

Extensive ablations identify key factors in logic proxy task design.

Abstract

Existing efforts to improve logical reasoning ability of language models have predominantly relied on supervised fine-tuning, hindering generalization to new domains and/or tasks. The development of Large Langauge Models (LLMs) has demonstrated the capacity of compressing abundant knowledge into a single proxy, enabling them to tackle multiple tasks effectively. Our preliminary experiments, nevertheless, show that LLMs do not show capability on logical reasoning. The performance of LLMs on logical reasoning benchmarks is far behind the existing state-of-the-art baselines. In this paper, we make the first attempt to investigate the feasibility of incorporating logical knowledge through self-supervised post-training, and activating it via in-context learning, which we termed as LogicLLM. Specifically, we devise an auto-regressive objective variant of MERIt and integrate it with two LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsFlan-T5