How susceptible are LLMs to Logical Fallacies?

Amirreza Payandeh; Dan Pluth; Jordan Hosier; Xuesu Xiao; Vijay K.; Gurbani

arXiv:2308.09853·cs.CL·January 3, 2025·2 cites

How susceptible are LLMs to Logical Fallacies?

Amirreza Payandeh, Dan Pluth, Jordan Hosier, Xuesu Xiao, Vijay K., Gurbani

PDF

Open Access 1 Repo

TL;DR

This paper introduces LOGICOM, a benchmark to evaluate LLMs' robustness against logical fallacies in debates, revealing that GPT-3.5 and GPT-4 are often misled by fallacious arguments.

Contribution

The paper presents LOGICOM, a novel diagnostic benchmark for assessing LLMs' susceptibility to logical fallacies in argumentative debates.

Findings

01

GPT-3.5 and GPT-4 can change opinions through reasoning

02

Both models are misled by fallacies 41% and 69% more often

03

A new dataset of 5,000 logical vs. fallacious argument pairs is provided

Abstract

This paper investigates the rational thinking capability of Large Language Models (LLMs) in multi-round argumentative debates by exploring the impact of fallacious arguments on their logical reasoning performance. More specifically, we present Logic Competence Measurement Benchmark (LOGICOM), a diagnostic benchmark to assess the robustness of LLMs against logical fallacies. LOGICOM involves two agents: a persuader and a debater engaging in a multi-round debate on a controversial topic, where the persuader tries to convince the debater of the correctness of its claim. First, LOGICOM assesses the potential of LLMs to change their opinions through reasoning. Then, it evaluates the debater's performance in logical reasoning by contrasting the scenario where the persuader employs logical fallacies against one where logical reasoning is used. We use this benchmark to evaluate the performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Amir-pyh/LOGICOM
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multi-Agent Systems and Negotiation

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Position-Wise Feed-Forward Layer · Label Smoothing · Linear Layer · Softmax · Weight Decay · Absolute Position Encodings