"Let's Argue Both Sides": Argument Generation Can Force Small Models to   Utilize Previously Inaccessible Reasoning Capabilities

Kaveh Eskandari Miandoab; Vasanth Sarathy

arXiv:2410.12997·cs.CL·October 18, 2024

"Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities

Kaveh Eskandari Miandoab, Vasanth Sarathy

PDF

Open Access

TL;DR

This paper introduces Argument Generation, a method that enhances small language models' reasoning abilities by generating and ranking arguments, outperforming traditional prompting techniques especially in smaller models.

Contribution

The paper presents Argument Generation as a novel prompting approach that improves reasoning in small models without added complexity, highlighting its effectiveness over chain-of-thought methods.

Findings

01

Argument Generation improves reasoning in small models.

02

It outperforms chain-of-thought prompting in zero-shot settings.

03

Smaller models benefit more significantly from this approach.

Abstract

Large Language Models (LLMs), despite achieving state-of-the-art results in a number of evaluation tasks, struggle to maintain their performance when logical reasoning is strictly required to correctly infer a prediction. In this work, we propose Argument Generation as a method of forcing models to utilize their reasoning capabilities when other approaches such as chain-of-thought reasoning prove insufficient. Our method involves the generation of arguments for each possible inference result, and asking the end model to rank the generated arguments. We show that Argument Generation can serve as an appropriate substitute for zero-shot prompting techniques without the requirement to add layers of complexity. Furthermore, we argue that knowledge-probing techniques such as chain-of-thought reasoning and Argument Generation are only useful when further reasoning is required to infer a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Software Engineering Research