Abstraction-of-Thought Makes Language Models Better Reasoners

Ruixin Hong; Hongming Zhang; Xiaoman Pan; Dong Yu; Changshui Zhang

arXiv:2406.12442·cs.CL·September 27, 2024·1 cites

Abstraction-of-Thought Makes Language Models Better Reasoners

Ruixin Hong, Hongming Zhang, Xiaoman Pan, Dong Yu, Changshui Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Abstraction-of-Thought (AoT), a novel reasoning format that enhances language models' ability to perform abstract reasoning, leading to improved performance on complex tasks compared to traditional methods.

Contribution

The paper proposes AoT, a new structured reasoning format, along with a large finetuning dataset, demonstrating significant performance improvements on challenging reasoning benchmarks.

Findings

01

AoT outperforms Chain-of-Thought in many reasoning tasks

02

A new dataset of 348k AoT samples was created for finetuning

03

Models trained with AoT show improved generalization on Big-Bench Hard

Abstract

Abstract reasoning, the ability to reason from the abstract essence of a problem, serves as a key to generalization in human reasoning. However, eliciting language models to perform reasoning with abstraction remains unexplored. This paper seeks to bridge this gap by introducing a novel structured reasoning format called Abstraction-of-Thought (AoT). The uniqueness of AoT lies in its explicit requirement for varying levels of abstraction within the reasoning process. This approach could elicit language models to first contemplate on the abstract level before incorporating concrete details, which is overlooked by the prevailing step-by-step Chain-of-Thought (CoT) method. To align models with the AoT format, we present AoT Collection, a generic finetuning dataset consisting of 348k high-quality samples with AoT reasoning processes, collected via an automated and scalable pipeline. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raising-hrx/abstraction-of-thought
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies

MethodsALIGN